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PREFACE 


Learning occupies one of the central positions in psychology, and an under- 
standing of the psychology of learning should be useful to almost every 
serious student of psychology. The aim of the first edition of this book was 
to bring together for the student the divergent interests in the psychology 
of learning. This, likewise, is the aim of the new edition. In other respects, 
however, the two editions are very different. The first edition was a Survey; 
this edition is less so. The present version treats fewer separate topics and 
treats these in greater detail. It is more theoretically directed than was the 
first edition and more selective. The more intensive coverage is achieved 
by eliminating topics that are of little or no fundamental importance and 
by eliminating peripheral topics which are generally more adequately 
covered in other courses. As before, however, the author has strenuously 
tried to resist the inclination to ride special hobbyhorses; the material is 
selected, but it is selected in such a way as to give the student as representa- 
tive a picture as possible of the contemporary concern with the Psychology 
of learning. 

As in the first edition, the early chapters attempt to cover the basic prob- 
lems of conditioning and learning. The material in these chapters deals with 
the acquisition of individual responses, and there is little attention given in 
these chapters to the interaction between responses. The basic controlling 
Variables— reinforcement, drive, frequency, etc. are discussed in these 
chapters. The middle chapters provide a bridge to problems in learning which 
center around interactions of learned responses. Chaining, or serial learn- 
ing, and transfer of training are examples of the topics covered. The later 
chapters deal with retention, problem solving, and the other processes under 
the control of implicit and mediating activity. 

The approach of the book is largely empirical, but it has been impossible 
(and undesirable) to avoid a theoretical bias, however slight. It is easy to 
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detect, and instructors will not find it difficult to present somewhat different 
theoretical positions without interference from the book. To the informed 
reader the influence of B. F. Skinner, E. C. Tolman, and the late C. L. Hull 
is obvious in the early chapters. Perhaps less obvious is the fact that the 
author is considerably indebted for the point of view in the later chapters 
to G. A. Miller, C. N. Cofer, C. E. Osgood, and others who have recently 
worked so intensively on the role of coding and mediating activity in the 
human intellectual processes. 

Again, as was the case in the first edition, there has been some attempt 
to compromise between what experimental psychologists find fruitful (or 
easy) to investigate in the laboratory and what students see as more funda- 
mental problems. Although there has been very little effort to treat “ap- 
Plied” problems as such, the relation of basic concepts in the psychology 
of learning to things outside the rat laboratory and the memory-drum cubi- 
cle is stressed throughout the book. The author has the fond hope that one 
day the psychology of learning will return to its original position as one of 
the basic sciences underlying educational psychology and similar applied 
fields. There is no doubt that some sections of this book have been influ- 
enced by that hope. 

Although the book is principally addressed to the junior or senior stu- 
dent in psychology, graduate students in Psychology and in educational 
psychology will find it useful. The book is relatively self-sufficient and can 
be studied with profit even by a student with only the slightest background 
in psychology. It is the author’s hope, at the same time, that it will provide 
a foundation for further study, particularly the study of theoretical issues 
which can only be understood with some background in the basic empirical 
Problems in the psychology of learning. 
ধু Numerous people have read all or portions of the manuscript. To these 
individuals the author gives his thanks. Many of the suggestions made by 
Teaders have been incorporated into the book; in other cases the author, 
perhaps rashly, has decided to let matters stand— partly because he wanted 
the book to reflect the contemporary scene in the psychology of learning as 
faithfully as possible. Special thanks are due to the author's wife, Ellin 
Deese, who carefully read many versions of the manuscript and who typed 
the larger portion of the final draft. Also, a special word of thanks is due 
to Clifford T. Morgan, whose very considerable editorial talents include the 
ability to teach others some of the skills of writing. 

Permission has been granted by many publishers and individuals to re- 
Produce illustrations or passages, and the author gratefully acknowledges 
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these permissions: American Journal of Psychology; the American Psy- 
chological Association for its several journals; Appleton-Century-Crofts, 
Inc.; Cambridge University Press; Ralph Gerbrands; Harvard University 
Press; Journal of General Psychology; J. B. Lippincott Company; Ronald 
Press Company; F. Loren Smith; Teachers College Contributions to Educa- 
tion; University of California Press; John Wiley & Sons, Inc. In each case 
the original source has been cited with the material reproduced. 


James Deese 
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CHAPTER 1 


INTRODUCTION 


Most human behavior is learned. This fact in itself makes clear why the 
study of learning is one of the most important parts of psychology. It is im- 
possible to understand the behavior of human beings and that of most ani- 
mals without knowing something about the basic Principles of learning. In 
Order to have a thorough understanding of almost any problem in psychol- 
0gy, we should know what effect learning and different learning experiences 
have upon the particular problem we are studying. For example, if we were 
interested, as personality theorists are, in the effects of traumatic experi- 
ences during infancy or childhood upon adult personality, we should dis- 
cover that we could understand this problem only to the extent that we 
understood the basic principles of learning. If we knew as much about 
learning as we should like, we could answer questions about the extent to 
Which traumatic experiences can affect adult life and the degree to which 
different segments of adult behavior may be affected. In short, it is not diffi- 
Cult to prove the importance of learning in psychology. 

It is probably more difficult to convince the student that learning theory 
is of enough general interest and importance to warrant study. Theory, even 
to the fairly sophisticated student of psychology, often means fussy dis- 
putes about matters of no consequence. We must admit that there is a 
certain justice in this view. This should not blind us, however, to the impor- 
tance of theory, both in the understanding of learning itself and in its appli- 
Cations to the study of personality and abnormal behavior. 

Theories serve several useful functions. They supply hypotheses about 
unsolved problems and serve to integrate and lend coherence to a scientific 


« discipline. They help us to organize factual information and to understand 
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its various applications to other areas. It is usually more reasonable to 
formulate a hypothesis about personality development, for example, from 
a well-established theory of learning than from a simple factual rule. This is 
true because a theory frequently suggests why a particular application 
should be made, while the extension of a factual rule, more often than not, 
is simply a guess. 

There is no better way to appreciate the value of theory than to read a 
survey of the experimental literature about some aspect of learning. If we 
do this, we shall find hundreds of experiments which overlap, duplicate, 
and frequently contradict one another. With no theory to guide us, we 
should find it difficult to tell important from unimportant experiments. If 
we were to memorize the contents of such a survey, we should have at our 
command a mass of highly specific information limited to narrow examples 
of behavior. Most general principles are established through theory of some 
sort. Therefore a theory can be used to lift us beyond the confines of iso- 
lated experiments into an awareness of general principles about learning 
and behavior. 

Unfortunately there are very few theoretical principles in the psychology 
of learning that are both unequivocal and widely accepted, because very 
few existing theories are supported by a wide variety of evidence. We must 
therefore be cautious in the extent to which we apply theory. It is necessary 
to strike a balance between the simple factual information at our disposal 
and the general principles developed or invented to explain and extend it. 
Since complete devotion to a given theory is likely to be dangerous given 
our present psychological ignorance, we should be cautious in accepting 
a universal set of explanatory principles that can be applied to all problems 
in learning. Rather, we shall study some limited aspects of certain theories 
and their applications to specific problems. Some of the grand sweep of a 
really systematic theory is lost in this way, but, considering the limitations 
of our factual knowledge, this approach is more realistic. 


SOME METHODOLOGICAL BACKGROUND 


Before we can discuss the factual evidence and theoretical principles of 
the psychology of learning, we need to establish some ground rules about 
finding evidence and discovering or inventing theoretical Principles. We 
shall now look briefly at the nature of experiments and the logic of scientific 
method as they apply to some of the problems we shall examine later. 
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Experiment and Observation 

A scientific observation occurs when one looks at an event and records it 
in such a way that any other competent observer, given the same conditions, 
would make the same report. An experiment, in the simplest sense, is an 
Observation made under conditions that are, to some extent, of the ob- 
Server's own devising. In an experiment the investigator can actually change 
and control the events that occur. Thus, watching squirrels retrieve buried 
nuts from the ground could be a scientific observation, but to turn this into 
an experiment, we should have to control some of the conditions of ob- 
servation (such as the place where the nuts are buried). 

Most of the factual information in the psychology of learning has been 
gathered in experiments, because, whenever possible, investigators prefer 
to experiment rather than simply to observe. This preference is well 
founded; it is always easier to infer causal relations when the events under 
Study are controlled. The kind and extent of the control in the observation 
limit the adequacy and generality of the causal inferences the investigator 
may make. If only one condition is systematically varied, we can be a good 
deal more certain of the cause than if all conditions were allowed to vary 
haphazardly. Since we are usually interested in making causal inferences 
from our observations, it will be worthwhile to glance at some of the rules 
for conducting experiments. 

Independent and dependent variables. There is a particular way in 
Which events are controlled in an experiment. In the simplest kind of ex- 
periment, all conditions are held constant but one, and this is allowed to 
Vary in known ways. For example, we may want to study the effects of 
different kinds of rewards upon the speed with which monkeys can learn a 
new habit. To do this, we should systematically vary the nature of the 
Teward given to different monkeys and try to keep constant in all monkeys 
the effects of motivation, previous history, testing conditions, etc. Since we 
had controlled all the variables but reward, we should be safe in-making 
inferences from our results about the effects of quality of reward upon 
Speed of learning. 

In this example, kind of reward is an independent variable because it is 
independently controlled and varied by the experimenter. The changes in 
behavior that result from changes in reward provide the dependent variable. 
Speed of learning, in this example, is the dependent variable because 
Changes in it depend upon changes in the independent variable, reward. 
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Independent variables, then, are the conditions that are systematically 
varied by the experimenter; the dependent variables are the consequent 
changes in behavior. 

More than one independent or dependent variable. Tn the example just 
mentioned, all the variables but kind of reward were controlled, either by 
being held constant or by being allowed to vary randomly from animal to 
animal. In some experiments, however, we may use more than one inde- 
Pendent variable and measure more than one dependent variable. 

Experiments with more than one independent variable are quite com- 
mon. Investigators frequently use this method because they suspect that the 
two independent variables somehow modify each other’s influence on the 
dependent variable. In these circumstances, we speak of the result as an 
interaction between the two variables. For example, reward and motivation 
interact in their influence on behavior. Under very low or under very high 


motivation, animals will frequently not discriminate among rewards of 
Slightly different quality. But under moderate motiva' 


hunger, for example—animals will discriminate am 
ent rewards. Thus, it is easy to see that the effect 
is modified by the effect of motivation on behavior. 
Sometimes, too, we may use more than one de 
periment. We may be interested in the influence 
of learning and general activity. In this case we 
should not get by making two separate experime 
ing and one on general activity level. The ad 
quite in the same form as that achieved in th: 
Variables; usually it comes in the form of a c 
dependent variables. Speed of learning and gene: 
correlated, for example, under the influence 
or motivation, that we Suspect there is some 


accounts for the changes in both speed of lea: 
Functional relationships. 


tion—a mild degree of 
ong qualitatively differ- 
of reward on behavior 


Pendent variable in an ex- 
of reward upon both speed 
Obtain information that we 
nts, one on speed of learn- 
ditional information is not 
© case of two independent 
orrelation between the two 
Tal activity may be so highly 
of different kinds of reward 
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because they show at a glance the nature of a functional relationship. In 
graphic presentation the independent variable is always placed on the 
horizontal axis (the abscissa) and the dependent variable on the vertical 
axis (the ordinate). A good example of such a graph is Figure 4 on page 21. 
Here the magnitude of a conditioned response (dependent variable) is 
Plotted against the number of trials of training (independent variable). 


Definitions in the Psychology of Learning 


The term reward that we have been using is a common word in our 
vocabulary. It means something to everyone, but, unfortunately, nearly 
everyone interprets it in a slightly different way. If a term is to be useful 
to the experimental psychologist, it must be capable of exact definition; 
that is, it must refer eventually to a measurable fact—a concrete object or 
condition. Many words that refer to psychologically important events are 
also used in an everyday sense. The common definitions of these terms, even 
When explicitly set forth in a good dictionary, depend upon casual observa- 
tions, tradition, and the opinion of authorities. Of course, there can be as 
many definitions of this sort as there are dictionaries. 

Operational definitions. In order to make our psychological language 
as explicit as possible, we must redefine many familiar words as well as 
introduce some new words. Then we shall have to give these words clear 
operational meaning. An operational definition of a term, in the simplest 
Case, is a description of the operations performed by an experimenter or 
Observer and the changes in behavior that result. Desire, reward, motives, 
etc., can all be given surprisingly precise definitions if described in this way. 
A hungry rat, according to the operational definition commonly agteed 
upon by psychologists, is one that has been deprived of food for a period 
Of time after having been on an accustomed feeding schedule. 

By the same token, an operational definition of the term reward usually 
states that it is some change in the environment (presenting an animal with 
food, for example) that increases the likelihood of the animal repeating the 
behavior that immediately preceded the reward. Notice that there is no 
mention of “something that the animal likes” or “something that it wants 
to get”; rather, the definition is explicitly stated in terms of things that can 
always be observed and agreed upon. We may all disagree about what ani- 
mals “like” (unless we operationally define “like”), but most of us can 
agree that sometimes animals repeat what they have done before because 
it was followed by a reward. As a matter of fact, this is the fundamental 
aspect of voluntary learned behavior. 
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Hypothetical constructs. Of course, not all the terms used by psycholo- 
gists are given direct operational definitions. Someone, for example, Nay 
say that “learning is an increase in the excitatory potential for a given 
habit.” Excitatory potential in this definition is a hypothetical construct; 
it is not something that can be directly observed. Excitatory potential is a 
theoretical term invented by a psychologist to account for certain changes in 
behavior. Psychology abounds with these hypothetical constructs; as a 
matter of fact, the word learning itself is usually used in the sense of a 
hypothetical construct. Hypothetical constructs are useful (1) when they 
can be related to things that are Operationally defined and (2) when they 
have wider application than an Operational definition. 

The second requirement of a hypothetical construct calls for some ex- 
Planation. Let us take the example of reward again. Reward is occasionally 
used as a hypothetical construct rather than an Operationally defined term 
(though more usually the term reinforcement is substituted for reward). In 
Such a case it may be defined by saying that it is a “slight decrement in the 
level of neural excitability.” If we define reward as a hypothetical construct, 
Wwe must show that such a definition is useful. It would be useful if it could 
serve to relate our operational definition of 4 reward (in terms of change of 


behavior) to some other conditions. In this case we might argue that the 


hypothetical construct serves as a Suess as to why the behavior changes. If 


this is so, we ought to be able to draw a number of implications from the 
hypothetical definition of reward and put them to experimental test. If the 
construct stands up under test, we might conclude that we h 
tion” of the operational fact of the effects of reward on beh 
see that hypothetical constructs are potentially very useful 
Unfortunately it is sometimes very difficult to Put hypothetical constructs 
to a test (the above example is particularly bad in this respect) 
quently psychology abounds in constructs that are potentially 
but are only really so if we are willing to accept a gre 
sumptions. Nevertheless, we still find it necessary to m 


in the following discussion, even 
well 


ad an “explana- 
avior. Thus, we 
as explanations. 


» And conse- 
explanatory 
at many dubious as- 


ake use of constructs 


though we know that they are not always 
grounded in experimental fact. 


Some Psychologists make 
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and dependent variables. One example of an intervening variable that we 
shall see much of in this book is strength of response. Strength of response 
is not a dependent variable, but it determines what will happen to de- 
pendent variables. Thus, if we suspected that quality of reward affected 
Strength of response, we should have no way of measuring this effect 
directly. We could, however, measure the effect of quality of reward upon 
the likelihood of future occurrences of a response and upon the magnitude, 
Or size, of future responses. We should expect both likelihood and magni- 
tude to increase with preferred rewards. We can then use the single term 
Strength of response to cover both changes in probability of occurrence 
and size of responses. 

If we make a hypothetical construct out of the notion of strength of re- 
sponse, we add something to its definition that cannot be directly referred 


to a collection of observations. We might say, for example, that strength of, 


response is ultimately defined in terms of the configuration of protein mole- 
cules on the nerve membrane. It is unlikely that we shall ever directly ob- 
serve such configurations of molecules, but such a definition could have 
important implications for our interpretation of experimental facts. 

This distinction between hypothetical constructs and intervening vari- 
ables may seem abstract and, indeed, it is. When dealing with the intrica- 
cies of psychological theory, however, it is sometimes helpful, and we shall 
have occasion to use it in many connections. While intervening variables are 
useful in limited psychological theory, whenever a theoretical psychologist 
tries to relate the theoretical terms of psychology to neurophysiological or 
Physical events and theories, he makes use of hypothetical constructs. 

If we keep in mind the distinction between hypothetical ‘constructs and 
intervening variables, as well as the distinction between these and opera- 
tional definitions, we shall find it much easier to keep fact and theory 
separate and to evaluate theory against available factual evidence. 

The remainder of this chapter discusses an important operational dis- 
tinction between two types of behavior and two types of learning. Since 
there is a question as to whether or not this distinction involves theoretical 
as well as operational matters, the importance of keeping operational and 


theoretical terms separate should be apparent at the outset. 


TWO TYPES OF LEARNING 


Learning to typewrite is different from learning to ski, if for no other 
reason than that different movements and muscles are involved. Learning to 
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typewrite and learning to ski, however, are both examples of the same 
Processes of learning at work. In both Cases, initial attempts are corrected 
in response to information the learner receives about their success. Most 
Psychologists, however, are now convinced that not all learning involves 
the same basic Processes, though they do not agree about how many proc- 
esses there are. Some Psychologists have Suggested several types; Tolman 


(1949), for example, suggested six different learning processes. While few 
theorists agree with this Proposition, it is cl 


tional distinction between at least two typ 
the behavior of complicated organisms 
One is concerned with more Primitive orga 
more types (Humphrey, 1933). At least 
Sary, however, and a great many investig 


ear that one must make an opera- 
es of learning when talking about 
like dogs and men. Perhaps, if 
nisms, it is necessary to add even 


common, it is apparent that many psychologists 
types of learning, have been driving at 
amine two cases of learnin 
Principles. 


Two Examples of Learning 


Every student of Psychology is fa- 
onditioning made by Ivan P. Pavlov, 
V'S experiments are both well known 


e discussing, We Can use one of them 
Arning. 
In this experiment (Pavlov, 1927), 


1 For example, see Skinner (1938), Schlosber 


1937), Hi fi 
Movwrer (1947), Konorski (1950), Stephens oe. ), Hilgard ater 0uis (1940), 


A 
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Pavlov referred to the meat powder as an unconditioned stimulus. An 
unconditioned stimulus is any stimulus that has the ability to elicit a re- 
sponse without prior training.> Thus, when meat powder is placed in the 
mouth of a dog, a salivary reflex is immediately produced. This is called an 
unconditioned response. The tuning fork was referred to by Pavlov as a 
conditioned stimulus. Conditioned stimuli, in Pavlov’s experiments, are 
those which initially do not elicit the response under study but which come 
to do so by being paired with the unconditioned stimulus. Likewise, the 
Salivary response produced by the tuning fork is called a conditioned 
response because it is a learned response conditional upon the previous 
Pairing of stimuli. y 

This basic experiment of Pavlov’s has had an enormous influence upon 


the psychology of learning, and the terms unconditioned and conditioned 


stimuli and unconditioned and conditioned responses are part of the basic 
Vocabulary of the psychology of learning. Let us now look, however, at an 
example of learning that is quite different from the one studied by Pavlov. 

Conditioned bar pressing in the rat. Our second example is perhaps not 
quite so familiar, but it is equally important. This time we shall deal with 
the behavior of the laboratory rat, and as in the case of Pavlov’s experiment, 
this example will serve as a prototype for an important class of learning 


Processes. 
A white rat is placed ina l 


Proof. At one end of this box is a small lever, or bar y 
Wall. This bar is connected.to an automatic recording device and to a maga- 


Zine filled with small pellets of rat food. When Ee bar is pressed, a pellet 
Of food is dropped into the box from the magazine. The essentials of this 
device, usually called a Skinner box, can be seen in Figure 1 (Skinner, 
1938). HE - 

If the rat is hungry when it is placed in the box, it will explore the box 
quite readily. It will sniff the air, paw the walls of the box, bite here and 
there. Eventually, because there is little else to do, the rat will manipulate 
the lever. Sooner or later it will depress the bar enough to release a pellet 
Of food. The rat may not discover the food immediately, but when it does, 
0 fairly certain that the rat will press 


2-inch-square box that is relatively sound- 
that projects from the 


it will certainly eat the food. It is als 


2 The ability of the unconditioned stimulus to elicit a response may be because of 
learning that took place before the animal came to the laboratory. Operationally, how- 
ever, the “unconditioned” ability of a stimulus refers to the state of things at the be- 
Binning of an eriment. Obviously, if one wishes to state that a particular uncon- 
ditioned stim as an innate eliciting tendency, he must carefully control the life 


history of the animal. 
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the lever a second time. This time, perhaps, it will discover the pellet of 
food immediately. At this point the behavior of the rat will change dra- 
matically. Instead of resuming its casual exploration of the box, it will now 
Proceed to press the lever repeatedly. As a matter of fact, if it is hungry 
enough, it will press the lever at 2 rate determined only by the time it takes 
to put its nose down and eat the food. 

It is apparent that the rat has learned Or has been conditioned tc press 
the bar. Is this example of learning like the one studied in Pavlov’s labora- 
tory? To answer this question, let us see if we can make use of Pavlov’s 


the animal depresses the lever, 
(Skinner, 1938.) 


arises when we try to 
that the unconditioned 


the sight of the bar OF t 


he tactual Stimulati 
wall. What, then, i 


bar after it has “ 
Again, we are reduced to a guess. 


AVe a response (pressing the bar) tH 
aa fk g hat dra- 
matically changes its Character after the rat has been rewarded a few times 
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for performing the response. In short, we have a response which increases 
in likelihood as the result of reward. The more common psychological term 
for reward in this situation is reinforcement. We may diagram the sequence 


of events this way: 


Response Reinforcement Increase in strength 
y — — —> 
(lever-pressing) (food) of response 


The reinforcement produces the increase in the strength of the bar-pressing 
Tesponse. | | 

A comparison of the two examples. There are two important difter- 
ences between the two experiments we have just discussed. In the salivary- 
conditioning example, the response is directly under the control of the in- 
vestigator. Provided that the animal is in normal condition, the salivary 
response can be elicited simply by providing the appropriate stimulus. In 
the case of the lever-pressing example, however, the HVESEBEE cannot 
directly elicit lever pressing; rather, he must wait until it occurs spontane- 
ously. | 

The second difference is in the action of the reinforcement. Pavlov used 
the terms reinforcement and unconditioned stimulus interchangeably. This 
Usage has caused some confusion, though in a SETA: 4 is justified. The 
unconditioned stimulus is a reinforcer in Pavlov's experiment HetuSE with- 
Out it no conditioning would occur; it reinforces conditioning. The two 


and unconditioned stimulus, are not synonymous, 
a 


terms, reinforcement | 
the term uncondi- 


however, for we have just learned that we egnnot use ! 
apply to the reinforcement in the bar-pressing example. 
a 


tioned stimulus to 
»s the response, and we do not know 


The reinforcement in this case follow bh) 
What the stimulus is that leads to the response. Thus another difference be- 


tween Pavlov’s experiment and the bar-pressing experiment is that in 
t the unconditioned stimulus is also the reinforcement, 
essing experiment no unconditioned stimulus can be 
nforcement which follows the response can usually 
yyJov’s experiment the animal is reinforced whether 
ed response, while in the bar-pressing example the 


Pavlov’s experimen 
Whereas in the lever-pr 
identified, though a rei 
be easily identified. In P 
Or not it gives a condition a f 
animal is reinforced only when a response occurs. This is a very important 


difference. 


aes SEE ppes of Learning 
Principles Distinguishing the Two Types of 8 


We have looked at examples of two types of learning. Let us now turn 


to the basic principles that distinguish these two types; we have already 
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been introduced to some of these Principles by comparing the two examples 
with each other, but in this section we shall make the principles more ex- 
plicit and general. 

Operant versus respondent behavior. Ina long series of papers and in 
his book The Behavior of Organisms (1938), Skinner has pointed out that 
Wwe have not only two basic types of learning but also two basic kinds of 
responses. These two types of responses Skinner called respondents and 
Operants. Respondents are acts or behavior directly elicited by stimuli; they 
Obey the classical physiological laws of reflexes, and, indeed, the terms re- 
spondent and reflex are identical for all practical purposes. Operants, on the 
other hand, are acts or behavior for Which there are no readily observable 
external stimuli. Examples of Operant behavior are the familiar ones of 
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pendent upon the occurrence of the conditioned response. This is the case 
in the bar-pressing example, and indeed in any case in which an animal is 
“trained to do a trick.” Most of our attempts to teach children and animals 
are of this type; we make a reward or reinforcement contingent upon the 
behavior we want the subject to learn. 

Two types of behavior and two types of learning. In the example from 
Pavlov, the response was a respondent and the conditioning classical. In 
the example of the rat pressing the lever, the response was an operant and 
the conditioning instrumental. Do these correlations between respondent- 
classical conditioning and instrumental-operant conditioning always hold? 
Probably not, though in practice classical conditioning nearly always occurs 
with a respondent, and instrumental conditioning with an operant, There 
are, however, apparently mixed examples, and while we shall not deal with 
these at present, they will become important when we come to theoretical 
matters. 

There is considerable dispute as to whether the operational distinctions 
between two types of behavior and between two types of learning are basic 
theoretical distinctions as well. Are these differences limited to our tech- 
niques for investigating behavior, or do they reflect some fundamental 


differences in the processes governing behavior and learning? There is a 


distinction of hypothetical constructs that corresponds to the operational 
nd cases which are borderline in 


distinctions we have been discussing, a 
Operational distinctions become very important in deciding whether or not 
theoretical distinction should be made. These are problems, however, that 


We can defer for a while. 


CONDITIONS CONTROLLING SIMPLE LEARNING 


We have seen two examples of elementary kinds of learning. Such ex- 
amples are usually called conditioning. There is no special way in which 
ntiated from learning, but the term usually 


conditioning can be clearly differe 
ch simple responses are acquired under 


refers to cases of learning in whi 
Simple or restricted conditions. Some investigators (Mowrer, 1947) would 
like to limit the term conditioning to the Pavlovian or classical variety, but 
Such a restriction is difficult to maintain, since, as we shall see, some more 


Complicated examples of learning involve elements of both classical and 


instrumental conditioning. Furthermore, the two types obey many of the 
same laws. In this book, the word conditioning will be used to refer to iso- 
lated responses learned under simple conditions which preclude previous or 
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concurrent learning that would either help or interfere with the learning 
of the response under study. These elementary examples are extremely 
interesting to the Psychologist, because in them he Can see at work basic 
Processes that may be obscured in more complicated cases. 

Variables that determine the nature of conditioning. In the next few 
chapters, we shall look at Some of the variables that determine the course 


reinforcements are given for 

every response), time relationships between stimuli and between stimuli and 
responses, etc. 

Theories about conditioning. 

various conditions determine the 


£ such predictions; 
» Since theories are frequently not stated 
clearly enough to make unequivocal predictions about behavior. In one 

ove wrong, since they make clear and 
at a atively easy to tes টু 


CHAPTER 2 


REINFORCEMENT AND LEARNING 


asic concepts in the psychology of learning. 
In practice it often means the same thing as the term reward. Reward is too 
general in meaning for our purposes, however; also, it is debatable whether 
its ordinary meaning applies to reinforcement in the case of classical condi- 
tioning. Therefore we shall not make much use of the term reward; instead 
We shall substitute the more exact term reinforcement. 


Reinforcement is one of the b 


AN OPERATIONAL DEFINITION OF REINFORCEMENT 

Recall, for a moment, the example of instrumental conditioning given in 
In this example, a laboratory rat learned to press a bar, 
a pellet of food. The food was clearly 
hich the rat pressed the bar, for 


the first chapter. 
and this response was reinforced with 
responsible for an increase in the rate at w 
We know that without it the rat would only occasionally have pressed the 
a few pellets of food in this way, however, it pressed 


bar. Having earned 
as it could, pausing between responses only 


the bar just about as rapidly 


to eat the pellets. | 
If, instead of following the bar-pressing response with a pellet of food, 


We had used the sound of a buzzer, would the rat have learned to press the 
bar? Probably not, our common experience with rewards would tell us 


(though, it should be remarked, sometimes our common experience can 


lead us to the wrong conclusions). Here, however, our common sense would 


be right; in this situation food is a re 
1s reinforcing because it can produce an increase in the frequency of the 
response that it follows. Obviously, because the buzzer cannot produce 


inforcement and a buzzer is not. Food 


Such an increase, it is not a reinforcement. 
15 
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A definition of reinforcement. With this example in mind, we can now 
look at a formal definition of reinforcement. A reinforcement is any stimulus 
event that will increase or maintain the Strength of a response or stimulus- 
response connection associated with it. A reinforcement in the case of 
operant behavior is a stimulus that follows the Occurrence of the response 


and rewards it; a reinforcement in respondent behavior is the stimulus that 
elicits the unconditioned reflex. 


The one feature of this definition that ha: 
the phrase strength 0f response. Stren 
vening variable (see Chapter 1), 


S not yet been made explicit is 
gth of response is Properly an inter- 
something that cannot be directly ob- 
th of a response from some aspect of 


- Thus, in order to specify the formal 
definition of reinforcement completely, we need to Say exactly what de- 


pendent variables can be used to indicate the strength of response. 
Strength of response. The most im 


Problems created 


strength of response 


‘ In the case of operant behavior 
and instrumental conditioning, the methods of assessing the effects of rein- 
forcement are quite simple. Usually the 


» AS it is at the Outset of the ex- 
the result is a nearly hori- 
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Zontal line with occasional bumps which show the occurrence of responses. 
When the response becomes conditioned, however, the response occurs 
much more frequently, so that the line goes up rapidly. The actual learning 
itself, in this example, is shown by the change in the slope of the curve. 

In classical conditioning the situation is usually more complicated. As 
explained in the first chapter, the reinforcing stimulus in the classical condi- 
tioning of a respondent is the unconditioned stimulus. Thus, the stimulus 
that elicits the response in the first place also provides the reinforcement. 
The conditioned or learned response, however, occurs to the conditioned 
stimulus, and because the conditioned stimulus usually precedes the un- 
conditioned stimulus, the conditioned response usually “anticipates” the 
unconditioned stimulus. This is not necessarily the case, however, for it is 


50 


RESPONSES 


60 


TIME IN MINUTES 
mental conditioning. All responses were reinforced. There 


Figure 2. A lati' rve of instru! 
oe Rea 3 but the fourth is followed by a rapid 


is no evidence of learning following the first three responses, 
change in rate. of responding. (Skinner, 1938.) 


Possible to condition a respondent when the unconditioned and conditioned 
stimuli are simultaneous. If they occur simultaneously, then we must use 


Special “test” trials to find out whether conditioning has taken place. With- 


Out these trials we should have no way of knowing whether a particular 
Tesponse was the result of the conditioned or of the unconditioned stimulus. 
Tf we make the two stimuli simultaneous and then test for a conditioned 
tesponse by omitting the unconditioned stimulus on some trial, we can de- 

ave a conditioned response. According to the 


termine whether or not we h 2p? 
definition of reinforcement, however, prior association of the conditioned 


Stimulus-response connection with the unconditioned stimulus or reinforce- 


Ment is necessary, so that if we omit the unconditioned stimulus on too 
Many successive test trials the conditioned response will disappear. Inci- 
dentally, in the example of salivary conditioning, the indicator of strength 
Of Tesponse is usually one of magnitude of response—the amount of saliva 


Silven on any trial. 
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Trans-situational reinforcers. The definition of rEInfOrcEniEnt given 
above is open to the charge of being circular—it is only a definition, not a 
law. There are laws about reinforcement, however, and one of them is that 
most reinforcers for operant behavior are trans-situational (Meehl, 1950). 
This means that most reinforcements will strengthen all or most learnable 
responses in a given species. In other words, if a particular stimulus 
event is reinforcing for a bar-pressing response in the rat, it ought to be 
reinforcing for other responses, such as running a simple maze. Turning out 
a light is reinforcing for the laboratory white rat. This reinforcement can 


be used to teach a rat to press a lever that turns out the light or to run a 
simple maze that has a dark box at the end. 


Respondents are trans-situational in the sense that one can use almost 


any conditioned stimulus for a given response, but because of the elicited 
character of the respondent, the unconditioned stimulus cannot be used 
a reinforcer for more than a few closely related respondents. Thus 
spondents are not trans-situational so far As responses are concerned. 

Meehl (1950) Proposes that all reinforce 
trans-situational and that every increase in strength of response or learning 
involves the use of a trans-situational reinforcer. This is probably not ex- 
actly true, for later in this book there are examples of Operant reinforcers 
that are specific to certain responses, but it is Probably true that most of 
the reinforcers of everyday life are trans-situational. This means that one 
can use almost any reinforcement to teach al 
of the reinforcement for ap 


particular response-reinforce 


as 
re- 


rs (at least for operants) are 


CONDITIONS OF REINFORCEMENT 


Let us now look at Some of the experiments that tell us 


tions under which reinforcement occurs and how these c 
the effects of reinforcement. These experiments 
tell us how behavior is det 
either are held const 
is such an essential 


look upon this Secti 
learning. 


about the condi- 
onditions modify 


are important because they 
ermined by reinforcement when other variables 


Ant or interact with reinforcement. Since reinforcement 
1 mechanism in the behavior of Organisms, one can 
On as a fundamental introduction to the Psychology of 
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Strength of Response and Number of Reinforcements 


Strength of response increases with increasing numbers of reinforced 
Occurrences of that response. Thus, the strength of the conditioned salivary 
response is low after only a few reinforcements; but after a large number 
Of reinforcements, say 100, it is close to maximal strength. Thus, it is 
apparent that there is a summation of the effect of reinforcement from one 
Occasion to the next. Let us take a brief look at how this summation takes 
place. 

Latent period and the number of reinforcements. Latent period is one 
Of the indicators of strength of response. Latent period, or latency, is the 
time between the presentation of a stimulus and the beginning of a response. 
Thus it is the same as reaction time. Latent period is inversely related to 
the strength of response; when latent period is short, response strength is 
high; when latent period is long, response strength is low. 

Several experiments show how the number of reinforcements affects 
latent period. Let us take one by Graham and Gagne (1940) as an example. 
These investigators prepared a runway for rats that was about 3 feet long 
and narrow enough to prevent the rats from turning around. At one end 
Was a goal box containing food; at the other, a starting box with a door 
leading to the runway. The door could be raised or lowered by the experi- 
Menter. Raising the door constituted the stimulus (note that since running 
is an operant, the stimulus in this case does not elicit the response). The 
latent period was the difference between the time the door was raised and 
the time the rat bevsan to run down the runway to obtain the food at the 
end. The rats were tested a number of times; each time they were allowed 
to enter the goal box and eat the food. 

Figure 3 shows what happened to the latent period. (Note that the latent 
Period is plotted logarithmically.) It gradually grew shorter. The largest 
decrease in log-latent period came between the first and second reinforce- 
Ments. Thereafter, between each successive reinforcement there were smaller 
decreases in the latent period until it hardly diminished at all. From these 
data, we can infer that the first reinforcement increased response strength 
the most and that each reinforcement thereafter added less to the total re- 
Sponse strength. This inference depends upon the assumption that strength 
of response is an inverse linear function of log-latent period. 

Magnitude of response and number of reinforcements. Magnitude of 
Tesponse is a second measure of the strength of a response. Let us take a 
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and the other of which does not. These other examples would show char- 
acteristics much like those of the two cases we have cited. Response 
strength would receive its greatest increment with the first or one of the first 
reinforcements, and with successive reinforcements the increase in response 
strength would become progressively smaller. This uniformity in simple 
learning curves has led psychologists to believe that there is a fundamental 
learning process that can be mathematically expressed as a theoretical learn- 
ing curve. Let us glance at the properties psychologists have proposed for 
such a curve and the equation that governs it. 


MAGNITUDE OF RESPONSE 
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Figure 4. Amplitude of the galvanic skin response as a function of the number of reinforcements. 


(Hovland, 19370.) 


A theoretical curve. The exact shape of the fundamental learning curve 
—the one relating response strength to number of reinforcements—has 


Occupied the interest of many theoretical psychologists. Investigators have 
tried to predict the shape of simple learning curves from mathematical 
assumptions and then to apply these predictions, in turn, to more compli- 
cated relationships in psychology. We shall now examine one of the most 
widely accepted of these attempts to discover the basic properties of the 


fundamental learning curve. 
Here are three of these basic properties that should be expressed in any 


mathematical or theoretical learning equation: 
1. Strength of response is an increasing function of the number of rein- 


forcements or reinforced trials. 
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2. Strength of response increases only to some limit; beyond this there is 
no further increase unless the conditions of the experiment change. This limit 
or asymptote may vary with different situations and different species of 
animals, but with the same animals and the same situation, it is a constant. 


3. Each successive reinforcement adds smaller and smaller amounts of 
response strength. 


With these characteristics in mind let us consider how strength of re- 
sponse R grows with number of reinforcements N. With each increase in 


number of reinforcements, there is some increase in strength of response. 
We can put this in the form of a ratio, 


AR 

AN 
where /\ means increase or increment. This expresses characteristic 1. 
Characteristic 2 says that strength of response can increase only to some 
limit. Let us call this limit M. Characteristic 3 Suggests that as strength of 
response gets close to the value of M, the increases in strength of response 
grow smaller. Thus as the difference M — R becomes smaller, AR be- 
comes smaller. All of this can be put in the form of an equation: 


ANE 
AN = KkK(M — R) 


This equation says that the increment in strength of response is proportional 


to the difference between the maximum strength possible and the amount 
dlready accumulated. As this difference bec 


strength becomes zero—no further increase 


The above equation may be written as a differential equation and in- 
tegrated. This yields 


Omes zero, increase in response 
is possible. 


R= M — Me-*# 
‘To those with little or no mathematics, this equation may not mean much. 
Suffice it to say that it describes learning curves of the sort presented earlier 
very well. 


This equation, or something very close to it, has frequently been a basic 
assumption in theories of learning (Hull, 1943; Estes, 1950; Estes and 
Burke, 1953; Bush and Mosteller, 1953). Sometimes it is elaborated to 
Provide for the possibility that the first reinforcement may not always pro- 
duce the greatest increase in Strength, that there may be a warm-up period, 

1 Some investigators have proposed 


quite different properti ij i 
curves. See, for example, Gulliksen (1934). fins Ll 
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so that the largest increments come after the first few reinforcements. In 
this case the curve will be S-shaped. Despite the possibility of minor modi- 
fications, the ideas presented above tell us about the basic elements that go 
into theoretical learning curves. Sometimes these elements are described in 
terms of hypothetical neural events or samples of stimuli conditioned on 
each trial, but in any case the characteristics outlined remain the same. 


Amount and Type of Reinforcement and Strength of Response 


The fundamental fact about learning is that behavior changes progres- 
Sively as a function of the number of reinforcements. This relationship, 
however, is modified by a number of conditions of reinforcement, such as 
the amount and quality of the reinforcing agent. In the case of food rein- 
forcement, for example, a large amount of food results in a greater strength 
Of response than a small amount (Grindley, 1929). In itself this is trivial, 
for it is limited to the kind of animals used in the experiment (chickens, in 
this case), the particular response, and the kind of reinforcement. Other 
experiments, however, give us grounds for believing that this relationship 
is Senerally true for simple conditioned operants and food reinforcement. 
Guttman (1953), for example, finds that it is true of the lever-pressing 
Tesponse in rats for various concentrations of sugar and that it is also true 
Of peanuts used as reinforcement for problem solving in monkeys (Harlow 
and Meyer, 1952). 

Even if we use a large experimental literature to establish a general law 
about strength of response and amount of reinforcement, it seems rather 
trivial, if for no other reason than that it appears to be obvious. There is 
Something about this relationship that is not quite so obvious, however, 
and that is the way in which amount of reinforcement affects behavior. The 
exact way in which this happens involves an important distinction in the 
Psychology of learning. 

The distinction between learning and performance. For some time psy- 
chologists have been aware that there could be a difference between what 
an animal learns and what it does. It is possible, for example, that a rat 
Could learn that pressing the lever in the Skinner box leads to food but 
Would not show that it had learned, perhaps because it was afraid of the 
Noise the food-delivery apparatus makes. Or a rat may know the location 
ঃ food in a maze, but because it is not hungry, it may not show any par- 
ticular disposition to go to the food. Thus, the fact that an animal does not 
Perform very well in a learning test does not mean that the animal has not 
learned what it should. It is possible, however, to devise special tests that 
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make it possible to tell whether an animal has learned or not, even when its 
performance is poor. A 

This distinction between learning and performance is important to the 
question of how amount of reinforcement affects behavior, for there is a 
good deal of evidence to suggest that amount of reinforcement affects per- 
OHA but not necessarily learning. 

Two investigators (Crespi, 1942, 1944; Zeaman, 1949) have attempted 
to separate the effects of amount of reinforcement on learning from the 
effects on performance by (1) training rats to run a simple runway, such as 
the one described earlier in this chapter, under various amounts of food 
reinforcement, and then (2) switching the amount of reinforcement when 
the rats had reached a stable level of performance. The results of these 
studies are well defined. The amount of reinforcement has no effect on the 
rate at which animals approach the final level of performance, but it does 
affect the final level they achieve. In other words 
plete, animals that have received different 
perform at different levels, but they do not le. 
limit of response strength M is affect 


but the rate at which this limit is re: 
trains a ra 


» after learning is com- 
amounts of reinforcement do 
arn at different rates. Thus, the 
ed by the amount of reinforcement, 
ached, k 


t with one amount of reinforcemen 
a higher amount, the ch 


» is not. Furthermore, if one 
t and then suddenly shifts to 
ange in performance is almost instantaneous—the 
rat does not have to learn to appreciate the new amount of reinforcement; 
it produces an instantaneous change in the rat’s behavior. 

Both of these studies used a ve 
rats had to do w 


all. For example, in teach- 


ing rats to discriminate between a black goal box a 


Placing food in one and not in the other, it was di 
of food makes no difference in either le. 
1949). In another example amount of fo 
ence in performance in a complic 


running response, show little or n 
the runway case, the measures of 
formance than of learning. There i 
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the more complicated examples, because the animals have relatively difficult 
problems to learn, the measures of behavior more nearly reflect learning. 
Because the behavior reflects more learning than performance, amount of 
reinforcement does not appear to change behavior very much. If no food 
at all is given, however, performance deteriorates badly. 

There are several other studies in the experimental literature (Maher and 
Wickens, 1954; Reynolds, 1950a, 1950b) which support the conclusion 
that learning is usually independent of amount of reinforcement while per- 
formance may not be. There is, however, a fundamental problem that is 
raised by one of these studies. 

In this study the investigator tried training rats in a maze with no rein- 
forcement at all. It would seem reasonable, by extending the principle that 
Wwe have already stated, to say that the absence of reinforcement affects 
Performance, not learning. This is fundamental because it suggests that rein- 
forcement is necessary for performance but not for learning itself. This 
Problem is so basic and so controversial that it has not yet been fully settled. 
For years investigators have been trying to find ways to decide whether 
reinforcement fects only what an animal does or both what it does and 
What it learns. One of the ways in which they have tried to reach a decision 
in this matter is to compare learning under some reinforcement with learn- 
ing under no reinforcement. Such experiments are called latent-learning 
experiments. 

Latent learning. The latent-learning experiments grew out of the theo- 
ries of E. C. Tolman (1932, etc.). One of Tolman’s principal ideas was 
that reinforcement —or reward—affected performance but had little or 
Nothing to do with learning. The latent-learning experiments were designed 
lo test this idea, and one of the earliest was made by Blodgett (1929). 

Blodgett allowed one group of hungry rats to explore a maze that did 
Not have food in the goal box. This group of rats, therefore, was not rein- 
forced, argued Blodgett (though this argument is suspect, as we shall see). 

Second group of rats explored a maze with food in the goal box, and 
therefore it received the conventional reinforcement. 

There was a difference between the behavior of these two groups. The 
Sroup which was reinforced with food seemed to learn on schedule, but the 
Unreinforced rats showed little improvement. After seven days, the rats in 
this Sroup were entering almost as many blind alleys as they had on the first 
day. At this time, however, Blodgett put food into the goal box for these 
rats. Their performance suddenly and dramatically improved, so that almost 
‘Immediately they made as few errors as did the previously reinforced rats. 


26 THE PSYCHOLOGY OF LEARNING 


These results can be seen in Figure 5. They show us that the rats which 
did not receive the food as reinforcement were able to profit from their ex- 
perience in the maze, though the results of such experience did not show up 
until a reward was provided. Because the learning that apparently took 
place during the unreinforced trials was not evident until food was intro- 
duced, the learning was said to be “latent.” 

Many investigators have repeated this or similar experiments (Thistle- 
thwaite, 1951), but not all investigators have been able to find what 
Blodgett found. A careful repetition of Blodgett’s study (Reynolds, 19456), 
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anxiety-provoking for the rats. 


ৰ that Blodgett’s experiment is rather like the studies on amount 
Of reinforcement. The group that did not receive food was Weakly reinforced 
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and did not improve very much until given a strong reinforcement; then 
improvement occurred on one trial. Thus, Blodgett’s study suggests, as do 
the studies on amount of reinforcement, that reinforcement affects per- 
formance, but not rate of learning. 

Two more experiments on latent learning deserve mention. In these some 
of the critical weaknesses of Blodgett’s study are removed. In one (Buxton, 
1940), rats were permitted to live in a maze for a period of several days. 
They were placed in the maze at different positions and removed from it 
at different positions. Consequently, the animals explored all parts of the 
maze to about the same extent, and they were not differentially rewarded by 
being removed from any particular part of the maze. 

The important result of this experiment is that it was not necessary to 
Live the rats one rewarded trial before the sudden improvement in per- 
formance occurred. Instead of allowing the rats to run through the maze to 
discover the food the first time, as Blodgett had done, the experimenter put 
them directly into the goal box, so that they had a chance to see the food. 
When they were put at the beginning of the maze, most of the rats were 
able to find their way to the goal box without making many errors. 

A second experiment (Seward, 1949) confirms this result, and these two 
experiments taken together show that it is possible for rats to learn the 
Seneral plan of a simple maze when they are not differentially reinforced 
in any particular part of the maze. Though again, until the food is intro- 
duced as a reinforcement, performance does not show what the rat has 
learned. 


Time of Reinforcement 

Another important variable in the instrumental learning of operant be- 
havior is the time between the response and the reinforcement. This is not 
important in classical conditioning because there is no well-defined way of 
Varying the time between the conditioned response and the reinforcement, 
though we may vary the time between the conditioned stimulus and the 
Teinforcement (unconditioned stimulus). We shall examine this problem 
later. For the present we shall restrict ourselves to examples of operant 

ehavior. 

In an experiment by Perin (1943) we can see the basic nature of the rela- 
tionship between time of reinforcement and strength of response. Perin 
Used a Skinner box and rats. He slightly modified the usual procedure in 
the Skinner box so that a movement of the lever either to the right or left 
Would produce a pellet of food. After the animals had learned to move 
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the rod in both directions and had displayed their preference for one direc 
tion, the apparatus was changed so that a movement of the rod in the 
preferred direction would give no food. A movement of the rod in the other 
direction, however, would cause the food pellet to be delivered. 

The most important factor in this experiment is that the time between 
response and reinforcement was varied, ranging between 0 and 30 seconds 
for different rats. As illustrated in Figure 6, the results were clear. The 
sooner the reinforcement was delivered, the higher the rate of learning. As 
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delay between a response and a reinforcement can be extended to approxi- 
mately 20 minutes and rats will still learn (Wolfe, 1934). We shall deal 
With this problem shortly. 

The other problem created by this experiment is indicated by the label 
“Slope of learning curve,” on the ordinate of the graph in Figure 6. Now 
this plotting suggests that time of reinforcement, rather than amount of re- 
inforcement, affects both the limit and the rate of learning, and, indeed, this 
is the conclusion of an important theorist (Hull, 1951). Let us look at the 
implications of this finding. 

Time of reinforcement and learning versus performance. Two experi- 
ments subsequent to Perin’s show that the situation is complicated. Logan 
(1952) tried an approach similar to that used in the learning versus per- 
formance problem pertaining to amount of reinforcement; he trained rats 
to a steady level of performance under two different delays of reinforce- 
ment and then switched these delays. Initially, performance was poorer for 
a long delay than a short one. Animals that had been trained on a long delay 
Sradually improved in performance when switched to a short delay, while 
the converse was true of those switched in the opposite direction. The fact 
that the change was gradual and not immediate suggests strongly that delay 
Of reinforcement affects both learning and performance. A later experiment 
by Seward and Weldon (1953) shows essentially the same result. 

These findings pose a problem since most theories find it difficult to ac- 
cept the fact that amount of reinforcement affects only performance while 
delay of reinforcement affects both learning and performance. No simple 
explanation of this is possible, but let us look at a somewhat rudimentary 
Version of one possible answer (Seward and Weldon, 1953). 

One of the things animals have to learn in psychological laboratory tests 
is the relationship between what they do and the presence of the reinforce- 
ment. Put anthropomorphically, sometimes it seems as if the animals have 
a different notion about what they are supposed to be doing than the experi- 
mMenters do. Skinner (1948) has pointed out that animals sometimes per- 
form responses which are not necessary to Obtain the reinforcement; this 
Skinner called “superstitious” behavior in animals. For example, a rat may 
Appear to have the idea that it is necessary to turn around once counter- 
Clockwise before pressing the lever. From Skinner's demonstration it ap- 
Pears likely that these false correlations between behavior and the reinforce- 
Ment occur most frequently when reinforcement is delayed. In other words 
there is more uncertainty about which response is being reinforced when 
there is a delay between a particular response (such as lever pressing) and 
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the reinforcement. Since the reinforcement is sometimes connected by the 
animal to incidental behavior, the learned dependency between reinforce- 
ment and the response that actually produces it is weaker. 

One implication of this explanation is that we Should reconsider the 
relationship between amount of reinforcement and learning. It is all right to 
say that any amount of reinforcement will have about the same effect on 
learning as any other amount, but Any response the animal learns will prob- 
ably be weaker without reinforcement than it would be if it were reinforced. 
To put it anthropomorphically again, an animal might learn many things 


because they have equally interesting consequences, but because the 
learns these things more or less simultaneously, 
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learned without the Original unconditioned stimulus, Pavlov called a higher- 
order conditioned response, but today we generally speak of this as second- 
Ary reinforcement. 

An experiment on maze learning provides an example of secondary 
Teinforcement in instrumental behavior (Saltzman, 1949). In this experi- 
ment rats were trained to run down a straight runway to a goal box that 
Contained food. The goal box was either black or white. In one condition, 
the goal box always contained food when it was black but did not when it 
Was white. After this training, the rats were taught a simple maze in which 
they had to choose between two pathways, one leading to a black goal box 
and the other to a white one. The rats learned to £0 to the black goal box, 
even though it was never baited with food. Thus it is clear that the black 
Soal box itself was reinforcing because it had been paired with a primary 
reinforcement, food, in another situation. The black goal box was thus a 
Secondary reinforcement. 

A general definition of secondary reinforcement is that it is any stimulus 
that acquires reinforcing power because it is paired with a primary or un- 
learned reinforcement. It is easy to see that the concept of secondary rein- 
forcement is important, for it accounts for learning in which there seems 
to be no reinforcement based upon a known biological drive. However, we 
May ask, what does secondary reinforcement have to do with the time be- 
tween a response and its reinforcement? Since this is an important question, 
let us examine it in detail. 

Time Of reinforcement and secondary reinforcement. The Principle of 
Secondary reinforcement enables us to describe why delaying reinforcement 
Only sometimes has a deleterious effect upon learning. We mentioned earlier 
An experiment on lever pressing in which learning practically disappeared 
if the reinforcement was delayed much more than half a minute, while in 
Another experiment on maze learning the reinforcement could be delayed 
for almost half an hour and the rats would still learn. Let us compare 
the two. 

In the maze learning experiment (Wolfe, 1934) the animals had to learn 
to go to one arm of a T-shaped maze. The goal box at the end of this arm 
Contained food, and the identical goal box in the other arm did not. The 
Maze had doors just before the goal boxes, so that the animals could be 
Tetained in the arms of the T before they were allowed to enter the goal 
box, Thus the arms of the T served as retention chambers. 

It is the retention chamber that explains why delay did so little harm to 
learning in this situation. Since the retention chamber in one arm always 
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immediately preceded food, it came to be a specific secondary reinforce- 
ment. Thus, no matter how long the animals were delayed before they 
entered the goal box, they could learn to turn to the side with food— simply 
because of the secondary reinforcement associated with this particular 
retention chamber. 

In the lever-pressing experiment (Perin, 1943), the experimenter went 
to a good deal of trouble to eliminate all secondary reinforcement. He built 
the apparatus so that the rod was removed immediately after the rat had 
pressed it; this stimulus, which might have become a secondary reinforce- 
ment, was therefore never present when the rat actually got the food (except 
when the delay between response and reinforcement was zero). Thus there 
was minimal opportunity for the lever to become a secondary reinforcement. 
As a result, the amount of delay between the primary reinforcement and the 
response had to be short if the animal was to learn, for there was little or 
no secondary reinforcement to bridge the gap. 

The gradient of reinforcement. 
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ested in learning. Although it is certainly true that the question of why 
Teinforcement occurs is important, we have deferred discussion of this 
Problem because many basic issues are still unsettled. On several important 
matters we cannot even state any very firm conclusions. Some questions, 
however, have been settled by experimental work, and theorists have come 
to some agreement on still others. Let us look at the basic issues and see 
how some of them have been tentatively settled, while others have remained 


Controversial. 


Basic Issues in Theories of Reinforcement 


In the following section we shall examine some of the more important 
issues involved in the various theories of reinforcement and the attempts 
that have been made to resolve them. Two problems are of paramount im- 
Portance. One is the question of why certain stimuli appear to be rein- 
forcing and others do not. The other is whether reinforcement determines 
learning or performance. In addition there is a subsidiary problem of 
Whether or not the same principles of reinforcement apply to both re- 
SPondent and operant behavior. 

Why are some stimuli reinforcing? It is fairly clear that there are only a 
limited number of stimuli that are reinforcing to every animal. Animal 
trainers know this, of course, and they are careful to reward the behavior 
they want to teach their animals with the proper reinforcement. If we are 
teaching tricks to dogs, a simple pat on the head will reinforce most of the 
time, though food is probably better. With the domestic cat, a pat on the 
head is usually not enough. Indeed, it is possible that the difficulty people 
have in teaching cats may be due in part to a more limited class of rein- 
forcing stimuli. 

Although it does not sound very enlightening to say that reinforcing 
Stimuli are things that animals like, this common-sense definition contains 
the germ of some of our most important theoretical ideas about reinforce- 
ment. 

The Origins of present interpretations of why some stimuli reinforce ani- 
Mals and others do not probably come from the theory of natural selection. 
Darwin and his successors tried to explain the process of species evolution 
in terms of natural selective breeding. The adaptive characteristics of or- 
Sanisms survive, said Darwin, and the unadaptive ones die out. Naturalists 
have Pointed out many examples of anatomical and physiological character- 
Stics that are adaptive and that seem to be perpetuated through selective 
breeding. Thus in evolution, biological changes in organisms have “good” 
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and “bad” effects, and, according to the natural selection theory, only the 
ith “good” effects survive. 

Lao apps to many early workers in evolution that behavioral pat- 
terns also provide good examples of the survival of adaptive mechanisms. 
Furthermore, in the lifetime of an individual there appears to be a process 
of selective adaptation. Organisms seem to learn those things which are 
useful—they learn the location of food supplies, water, hiding places, etc. 
The early literature of comparative psychology abounds with anecdotes 
illustrating the survival value of the things animals learn. 

Thus, the argument goes, adaptive behavior (that which preserves the 
animal from harm and keeps it fed and sheltered) survives and becomes 
learned, whereas unadaptive behavior does not. This concept was combined 
with the hedonistic ideas of Utilitarianism. Pleasure, it was held, is asso- 
ciated with adaptive behavior and pain with nonadaptive behavior. Thus 
Pleasure and pain were brought into an adaptive theory. 

This kind of notion—a behavioral hedonism combined with the sur- 
vival of adaptive traits—has had an enormous influence on the theory of 
learning and behavior. It is hard to find theorists who deny that reinforcing 
stimuli are connected with adaptive mechanisms in the animal. Many 
theorists deny the universal importance of adaptive reinforcement, how- 
ever, and others say that, while it determines what the animal will do, it 


has nothing to do with what the animal learns. Thus we face once more this 
issue of learning versus performance. 


Reinforcement in learning and performance. 


The question of whether 
reinforcement determines learning or merely pe 


rformance is a relatively 
new one. Most of the older investigators did not make a distinction. E. L- 


Thorndike, one of the first experimental psychologists to explore the adap- 
tive characteristics of reinforced behavior, never clearly stated whether he 
thought that reinforcement affected performance or both learning and per- 
formance. In his Principle of Effect (1898), which led to our current opera- 
tional notions of reinforcement, he said that rewards tend to stamp in 


behavior. This and other of his statements imply that reinforcement acts 
upon both what the animal learns and what 


k it does—indeed that reinforce- 
ment Is necessary for learning. 


This implication from Thorndike had considerable influence, and for a 
long time the dominant view was that an ada 


1 Ptive reinforcement was neces- 
sary for both learning and performance. Hull (1943) was the most influ- 
ential theorist to adopt this viewpoint. He held that adaptive reinforcement 
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Was necessary for the learning of both operant and respondent behavior. 
Hull was a quantitative-learning theorist and wrote nearly all his hypotheses 
as equations. It is interesting to note that near the end of his career Hull 
drastically revised his theory by rewriting his equations so that performance 
but not learning was determined by reinforcement (Hull, 1951). 

The need-reduction theory of reinforcement. Hull's original belief in 
the importance of reinforcement for both learning and performance led him 
to speculate on the mechanism of reinforcement, and his conjectures have 
Provided a specific theory that is valuable whether reinforcement affects 
learning and performance or just performance. This is the need-reduction 
theory of reinforcement. 

The need-reduction theory states that reinforcements are always related 
to specific motive states in animals and men. They reduce the level of or- 
ganic or secondary need derived by a process of learning from some organic 
need. Thus food is reinforcing for a hungry animal, just as water is for a 
thirsty one. Other drives, sex, for example, are also reduced by appropriate 
reinforcement. Most experimental work, however, tends to deal with hunger 
and thirst since both the level of drive and its appropriate reinforcement 
Are easier to control. 

There is little doubt that drives and their reduction control the strength 
Of responses. Whether all behavior is directly or indirectly controlled by 
Need reduction is impossible to answer, but there is experimental evidence 
Of the control exerted by factors such as reduction of hunger. An experi- 
ment by Miller and Kessen (1952), for example, shows that milk intro- 
duced directly into the stomach of rats by way of a fistula can serve as a 
reward in maze learning. The rats in this experiment could not taste the 
milk, so it was obvious that the milk was not reinforcing because of its taste 
Or because of “consummatory activity”; furthermore, a control experiment 
in which saline was used instead of milk made it clear that the critical factor 
is not just something in the stomach but rather the nutritive value of food. 

Not all behavior is controlled by primary reinforcement, according to this 
theory, For, of course, secondary reinforcements can be built upon primary 
reinforcement. Secondary reinforcements, moreover, involve secondary 
drives (Miller and Dollard, 1941). It is uncertain whether or not secondary 
teinforcements reduce secondary drives as primary reinforcements do 
Primary drives. In the case of fear (a secondary drive) it is thought that 
escape from fear (a secondary reinforcement) does result in the reduction 


Of the Secondary drive, but this seems to be a somewhat special case, so we 
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cannot make any general statement from it. It is quite possible, however, 
that secondary reinforcements do involve some reduction in a need anal- 
ogous to the reduction of a primary drive by a primary reinforcement. 

In recent years investigators have found evidence for a particular system 
in the brain that, when stimulated, reinforces behavior (Olds and Milner, 
1954). Since such stimulation does not reduce a biological need, it is argued 
that need reduction itself is not necessary for reinforcement. In the usual 
course of things, however, need reduction may activate this reward system 
in the brain (Olds, 1955). Although need reduction is what normally dic- 
tates animal behavior, it is not the essential element. Perhaps activation of 
a particular part of the brain is the important factor. 


Furthermore, there is reason to believe that there is a direct sensory, 


or hedonic, kind of reinforcement (Young and Shuford, 1954). That is to 
Say, some things are reinforcing because they have the right kind of sensory 


consequences, not because they satisfy some biological need. Thus saccharin 
is reinforcing for rats (Sheffield and Roby, 
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taste receptors. 
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the same time that we present a conditioned stimulus. Because the con- 
ditioned stimulus is simultaneous with the response, it comes, through 
association, to elicit that response. 

This association theory can be applied directly to classical conditioned 
responses. It also applies to instrumental conditioned responses, even 
though we cannot always produce experimentally stimuli that are simulta- 
neous with the response to be conditioned. Suppose, for example, we are 
trying to teach a dog to retrieve something when we say, “Go get it, boy.” 
We can get dogs to retrieve easily enough, but it is much more difficult 
to get them to retrieve only on a given signal. Let us see how this kind of 
Association theory says the process occurs. 

The best-known advocate of this kind of association theory has been 
E. R. Guthrie, who has this to say about association: “A combination of 
stimuli which has accompanied a movement will on its recurrence tend to 
be followed by that movement” (Guthrie, 1935, 1952). For Guthrie the 
classical conditioned response is the model for all learning processes. In 
most cases, however, we cannot observe conditioned stimuli (for example, 
the rat pressing the bar in the Skinner box); and so in Guthrie's theory 
they take on the character of hypothetical constructs. That is to say, we 
invent some conditioned stimuli for a particular set of learned responses. 
In the case of lever pressing, the conditioned stimuli are kinesthetic, accord- 
ing to Guthrie (1952). In other words, the rat makes a series of movements, 
these stimulate the sense organs of the muscles, and this combination of 
Muscular sensations serves as a conditioned stimulus. 

Thus, according to this particular version of association theory, if we 
Wish to teach someone an instrumental response, we try to arrange it so 
that the stimulus and the response occur together. Obviously with such 
@ principle of learning the distinction between classical and instrumental 
Conditioned responses becomes purely a matter of convenience. Guthrie 
(1952) believes that this distinction may even be a bit pernicious, since 
it tends to make us accept instrumental behavior as purely “spontaneous” 
and prevents us from looking for stimuli which might cause the behavior. 
| Strangely enough, however, Guthrie's major experimental study is on 
Instrumental learning. Guthrie is dissatisfied with the typical experiment 
On instrumental behavior, because, he claims, it examines the results of 
behavior, not the behavior itself. Take the lever-pressing case, for example. 
In the Skinner box we find out how many times per minute the rat presses 
the lever; ordinarily we are not interested in the way in which the lever is 
manipulated by the animal. Guthrie objects to this approach, for he believes 
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that it is the association of particular movements of the animal with stimuli 
that is really important and that the result of the movement is only second- 
ary. Consequently in an experimental situation, Guthrie and Horton (1946) 
studied the way in which cats learned to escape from a puzzle box. Instead 
of merely recording the time it took the cats to get out of the box or the 
number of times they escaped, Guthrie and Horton Observed the cats in 
action and took motion pictures. One of the things that impressed them 
was the degree of behavioral stereotypy. In many cases the cats’ move- 
ments were almost identical from trial to trial. This indicated to Guthrie 
that there was a close association between the cues and the movements 
involved. 

One of the implications of Guthrie’s notion of association of stimulus 
and response through contiguous exposure is that it takes only one pairing 
for conditioning to occur. This seems to £0 against everything we have 
said thus far, since curves show that learning is a gradual process. Guthrie 
has a good answer for this: An Ordinary instrumental act (such as lever 


Pressing or escaping from a puzzle box) involves many different move- 
ments. Thus it may take many trials for the 
movements that is eventuall 
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We shall have occasion to examine some of the mathematical properties 
Of this theory of Estes and Burke later. At present we should note that it 
Suggests, as does its parent theory by Guthrie, that all conditioning is essen- 
tially classical and that the distinction between classical and operant con- 
ditioning is only an experimental convenience, not a basic difference. Both 
Of these theories assign all conditioning to the classical variety because they 
Suggest that the basis of learning is the establishment of a correlation be- 
tween some novel stimulus and a response it has not hitherto evoked. By 
confirming this new stimulus-response connection enough times we can 
make sure that the elements of the stimulus population and the components 
of the response are sufficiently associated to produce efficient performance. 
We must refer to this association as a correlation of stimulus and response, 
because in instrumental behavior we cannot always cause the association 
to be present; we are only capable of devising a situation in which there 
Will be a correlation between a stimulus situation and a “spontaneous” 
Tesponse. Thus, this particular variety of association theory says that all 
learning is exemplified by classical conditioning, and only our lack of con- 
trol over unconditioned stimuli for instrumental or operant behavior pre- 
Vents us from doing experiments on instrumental behavior that are exactly 
Parallel to those on respondent behavior. 

In this kind of association theory, therefore, the function of reinforce- 
Ment is simply the mechanical one of providing an end to the stimulus- 
Tesponse sequence. Reward serves to prevent the animal from unlearning 
What it has already learned by keeping it from reacting in other ways to 
the stimuli that lead to the response. Food at the end of a maze keeps the 
Animal there instead of allowing it to wander through the maze unlearning 
Proper turns and learning new turns to some blind alley. Thus, the rat eventu- 
ally does learn the maze. Reward only serves to keep the rat from unlearning 
What it has learned. Incidentally, this suggests that the rat would learn the 
Maze if we simply removed it from the goal box at the end of the trial 
‘Instead of giving it food. Seward (1942) tested this notion with the Skinner 
box that is, he simply removed the animals from the box after they pressed 
the lever— and they did learn, though not so well as food-rewarded animals. 

It is difficult to see how the stimulus-response association theory could 
Predict performance in latent-learning cases without some additional prin- 
ciples, for latent-learning experiments show that animals learn more than 
they perform during the latent phase. This particular association theory, 
AS We have seen, puts its greatest emphasis upon performance—the move- 


ents made by the animal. 
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i i i us to 
Another association theory. The topic of latent learning brings 
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One of the arts in the control of behavior is knowing the range of reinforce- 
ments possible for a given animal and how to administer them for optimal 
results. Classical conditioning, of course, depends upon the reinforcing 
Property of the unconditioned stimulus. As does instrumental behavior, a 
classical conditioned response tends to disappear if it is not occasionally 
reinforced with an unconditioned stimulus (see Chapter 3). There is no 
doubt about the importance of the reinforcement operation in the produc- 
tion and control of behavior. 

There is now strong evidence, however, that reinforcement is not neces- 
Sary for learning, although it does probably modify learning. Thus rein- 
forced learning in a complicated problem, such as learning a long maze, is 
likely to be different from unreinforced learning. This is not because rein- 
forcement is necessary to learning, but because reinforcement indirectly 
Modifies what animals learn by determining what they do. 

Thus, although we described earlier how learning increases as a function 
Of number of reinforcements, we should now modify this to see how learn- 
ing increases as a function of the number of exposures or experiences in 
the learning situation. Usually we reinforce successive tries, so that an in- 
Crease in the number of reinforcements is correlated with the change in 
behavior, The essential requirement for a series of changes in behavior is 
Simply that the animal behave in a certain way on successive exposures to 
the situation. All that is necessary to maze learning is that the animal run 
to the goal box a certain number of times. We can usually accomplish this 
most efficiently by reinforcing the animal. Thus the important variable in 
learning is the number of trials which are given an animal on a particular 
Problem, 

The distinction between classical and instrumental conditioning. We 
have seen in Guthrie’s stimulus-response association theory that the distinc- 
tion between classical and instrumental conditioning is a matter of con- 
Venience. Other theorists, however, maintain that there is a fundamental 
difference between them. Mowrer (1947), for example, points to the fact 
that the reinforcement in classical conditioning is not need-reducing, while 
SUCh reinforcement is usually necessary to produce instrumental or operant 
Pehavior, Other investigators have pointed out that classical conditioning 
1s Characteristic of Sipe reflexes, and instrumental conditioning of volun- 
tary behavior. Certainly, at the present time, we cannot come to a definite 
Conclusion as to whether the distinction between classical and instrumental 
conditioning is real or merely convenient. This problem will come up in 
later chapters, and we shall see that it is of great current interest. 
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TEMPORAL ASSOCIATIVE FACTORS IN CONDITIONING 


Since reinforcement is not essential to learning, the most fundamental 
conditions we can describe are associations— associations between stimulus 
and stimulus, stimulus and response, and response and response. Associa- 
tion means that two events, stimuli or responses, must occur close together 
in time. We have already examined the data on response-reinforcement 
associations, but not the stimulus-stimulus relationships in classical condi- 
tioning. In this section we shall look at temporal factors in classical SO 
ditioning. These are of basic importance because classical conditioning 1s 
the simplest example of associative learning. 

Pavloy's studies on time relations in conditioning. Pavlov (1927) used 
two different techniques in the study of time relations in classical condition- 
ing. In one, the conditioned and unconditioned stimuli overlapped each 
other in time. The conditioned stimulus came either before the uncondi- 
tioned stimulus or the two were simultaneous, but the essential condition 
was that they overlapped. In the other technique, the two stimuli did not 
overlap. Usually the conditioned stimulus was presented first, and then, 
after an interval, the unconditioned stimulus. 

Pavlov thought that simultaneous presentation of the conditioned and 
unconditioned stimuli produced the most rapid conditioning, and in all 5 
experiments in his laboratory, they were presented in this way. Following 
this, he gradually lengthened the interval between the conditioned and un- 


conditioned stimuli, or allowed longer exposures of the conditioned stimulus 
before he presented the u 


this to mean that the conditione 
Tesponse was actively inhibited. He used 
4s an example of what he called “ 
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These observations are important in the present context because they 
indicate associations between conditioned stimuli and conditioned Tesponses 
to be extraordinarily flexible. Such associations do not have to be based on 
simultaneity Or, indeed, upon close temporal association. Pavlov could 
introduce delays of many minutes in the trace conditioning procedure and 
Still produce a conditioned response; the conditioned stimulus would come 
On briefly and then, after a wait, the conditioned response would occur. For 
the production of such delays it is apparently essential that nothing else 
Occur between the conditioned stimulus and its response. Pavlov’s experi- 
Ments were performed in sound-deadened rooms in order to control the 
Stimuli impinging on the animal, so that the animal would not be stimulated 
in any way between the occurrence of the conditioned stimulus and the 
time for the conditioned response. Such an absolutely controlled environ- 
Ment seems to be necessary for very long delays. The animals themselves 
Came to inhibit activity in the time interval, and in some instances appeared 
to go to sleep. Thus, as appears to be the case with delay between responses 
and reinforcement in instrumental conditioning, if nothing interferes with 
the Association the delay can be bridged. 

Rate of learning and delay of the unconditioned stimulus. A number of 
‘vestigators have studied the rate of conditioning as a function of the time 
interval between the conditioned and unconditioned stimuli. In these studies 
the highest rate of conditioning occurs when the onset of the conditioned 
Stimulus just precedes that of the unconditioned stimulus (Spooner and Kel- 
logg, 1947; Kimble, 1947; Moeller, 1954). These studies were all per- 
formed with human subjects; the conditioned responses were variously a 
hand movement, the blinking of the eyelid, and a galvanic skin response. 

spite these differences, the studies all agree that the highest rate of con- 
ditioning occurs when the conditioned stimulus precedes the unconditioned 
Stimulus by approximately half a second. Figure 7 shows the data on the 
Conditioned hand movement, together with some data from earlier studies 
Of the same response (Wolfie, 1930, 1932). Rs 

One Curious case of the time relation between the conditioned and un- 

Conditioned stimuli is shown in Figure 7. This is backward conditioning: In 
ackward conditioning the unconditioned stimulus precedes the conditioned 
Stimulus, Pavloy thought that conditioning could not take place in this case; 
OWever, the data presented in Figure 7 show some evidence for backward 


TOT He experiment by Kimble (1947) is not strictly an example of classical condition- 
in. শী f je 
8, though it is sufficiently close to be considered so for present purposes. 
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conditioning. It is probably not true associative conditioning, however, but 
more likely a sensitization of the whole response system. The unconditioned 
stimulus (an electric shock in the cases in which backward conditioning 
is found) sensitizes the subject so that he will give a response to almost 
any stimulus. This interpretation agrees with the data of Spooner and Kel- 
logg (1947). They find that the number of backward conditioned responses 
is high early in training but decreases during it. This is because the subjects 
readily adapt to stimuli other than the shock. 
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STIMULUS INTERVALS (SECONDS) 
Figure 7. Results of the Spooner and Kellogg (1947) and the Wolfle (1930, 1931) studies on the 


time interval between conditioned and unconditioned stimuli. The curve labeled “present study" is 


from the data of Spooner and Kellogg. Notice that the maximum frequency of conditioned responses 
occurs when the conditioned stimulus precedes the unconditioned stimulus by a fraction of a second. 
Backward conditioned responses rarely occur. 


Rate 0f conditioning and association. All studies show that conditioning 


is less efficient when the stimuli are presented together than when the con- 
ditioned stimulus precedes the unconditioned stimulus by a brief interval. 
This fact is fundamental to the interpretation of the associative processes. 
The association is not always between the stimuli, but may be between one 
stimulus and the trace, or aftereffects, of another. Just why asynchrony is 
best is not clearly understood at present. The brevity of the optimal time 
interval between the onset of the conditioned stimulus and unconditioned 
stimulus suggests that this effect has to do with the time of events in the 
central nervous system. The important fact, however, is that the rate of 
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conditioning drops rapidly as the time interval between the stimuli is in- 
creased to even a few seconds (Moeller, 1954). In trace conditioning this 
interval may be extended, but only through tedious training under condi- 
tions in which there is a minimum of external stimulation. Thus while the 
duration of the stimulus trace is probably long, it is easily obscured by other 
stimuli, and it is therefore difficult to obtain the kind of associations char- 
acteristic of classical conditioning except with very short time intervals. 


CHAPTER 3 


EXTINCTION AND 
PARTIAL REINFORCEMENT 


When learned acts are no longer reinforced, they usually diminish in 
strength. Frequently, for example, colicky babies learn to cry in order to 
obtain extra attention. If this happens, parents are sometimes advised to 
avoid picking up the baby when it cries so that it will stop crying for atten- 
tion. It is doubtful whether many parents last out the process, but the advice 
itself is testimony to the common knowledge of the effects of withdrawal of 
reinforcement. 

The effects of removing reinforcement from a particular bit of behavior 
are quite complicated. The general effect—diminished strength of response 
caused by a failure of reinforcement—is called extinction, and in the first 
part of this chapter we shall look at the way in which this is modified by 
different variables. In later sections we shall examine the more general prob- 
lem of the effects of intermittently given reinforcement —we shall see what 
happens when a given response is reinforced some, but not all, of the time. 


EXTINCTION 


In the simplest examples of learning, a response is reinforced every time 
it occurs. Thus, each time the rat in the Skinner box presses the lever, it 
receives a pellet of food. After the rat has learned about the correlation be- 
tween food and pressing the lever, it will continue to press the lever at a 
steady rate as long as it remains hungry. If the reinforcement is discon- 
tinued, the rat will continue to respond for a while, but at a gradually 
diminishing rate and with increasing irregularity. Figure 8 shows how with- 

46 
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drawal of reinforcement affects rate of responding in the Skinner box. From 
this figure it is clear that the rate of responding is high at the beginning 
(right after reinforcement is withdrawn) and that it gradually diminishes, 
so that by the end of the first hour without reinforcement, the rat is going 
long periods of time without giving a response. It is also clear that the rate 
of responding is quite irregular during extinction (compared with the highly 
regular rate under reinforcement shown in Figure 2). The rat may press 
the lever rapidly for a few minutes and then cease to press it altogether for 


a period of time. 
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Figure 8. An extinction curve for the leve ule 
t the beginning, but then it 


responses is plotted on the ordinate. The rate of responding is high a! 
begins to decline and become more irregular. Long periods of time pass without responding, until, 


finally, the animal almost completely stops responding. (Skinner, 1938, data from F. S. Keller and 


A. Kerr.) 


In the same way, the classical conditioned response gradually grows 
weaker if the unconditioned stimulus is removed. Thus, in Pavlov's experi- 
ment presented in the first chapter, if the tuning fork is sounded a number 
Of times without being paired with the meat powder, the salivary response 
to the tuning fork gets smaller and finally disappears. 

Resistanceto extinction. TEiSapparent Homithetwo,eXamples we lave 
Just looked at that reinforcement creates a Teserve of responses that can be 
emitted by the animal after the reinforcement is withdrawn. This reserve, 
Or potential for responding in the absence of reinforcement, is EEL called 
resistance to extinction OF, sometimes, the reflex reserve (Skinner, 1938). 

Resistance to extinction depends, to SOM ‘exten Pon a HUNSELOS 
reinforcements given an animal before extinction is begun. Figure 9 shows 
that if a small number of reinforcements is gIven, few bar-pressing responses 
are emitted in extinction before animals cease responding altogether. As 
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the number of reinforced responses is increased, however, the number of 
responses in extinction also increases. But the number of TESPODES INCTEASES 
only up to a limit. Thus, for low numbers of reinforcements, resistance to 
extinction depends upon the number of reinforcements that have been SIVEn 
before extinction is begun. Maximal resistance to extinction, however, is 
obtained with relatively few reinforcements, and after a certain point one 


can go on giving reinforcements without appreciably increasing resistance 
to extinction (Skinner, 1938). 


NUMBER OF RESPONSES TO EXTINCTION 
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Figure 9. Number of lever responses during extinction as a function of the number of reinforce- 
ments administered during training. (Redrawn from Perin, 1942; data from Perin, 1942, and 
Williams, 1938.) 


Time and Effort in Extinction 


Two variables of considerable theoretical importance in extinction are 


(1) the distribution of responses in time and (2) the effort required to 
make responses during extinction. These are linked here because together 
they provide information important to the theory of extinction. Let us first 
look at the problem of the distribution of responses. 

Distribution of responses in time. 


One of the main advantages of giving 
discrete trials during an experiment 


On learning is that the experimenter 
Can control the rate at which animals respond. In the Skinner box, as it 
is usually used, the rate of performance depends upon the animal and such 
indirect means of control as hunger. Many experimental ex: 
mental learning, including bar pressing, 
trials are given after the fashion of the clas 


amples of instru- 
can be so arranged that discrete 
Sical conditioned response. For ex- 
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ample, it is a simple matter to withdraw the bar from the Skinner box and 
to present it again only when we wish to begin the next trial. 

Control over the rate at which responses occur is important to students 
of learning because Pavlov (1927) reported that a classical conditioned 
response extinguishes much more rapidly when trials occur close together 
than when they are spread apart. More recently experiments have been 
performed to see if this is also true of instrumental conditioning. 

First of all, it seems clear that performance of instrumental behavior 
during reinforcement is impaired when trials are massed together (Hovland, 
1936; Spence and Norris, 1950). Secondly, most studies show that massing 
trials during extinction has the effect of decreasing resistance to extinction 
Or of decreasing the tendency to respond on any given trial during extinction 
(Gagne, 1941; Reynolds, 1945a; Rohrer, 1947; Teichner, 1952). Other 
studies, however, show that under some conditions resistance to extinction 
can be greater with massed trials (Sheffield, 1950) and that under still other 
conditions the spacing of trials probably makes no difference (Porter, 
1939). 

Teichner (1952) points out the probable reason for these differences. 
He Suggests that, in general, acquisition is slower when trials are massed 
closely together and that extinction is faster under these conditions, but that 
these effects are relatively small and easily obscured by other conditions. 
For example, Teichner’s experimental results ona running response in rats 
Show that the effect of spacing trials during Extinction depends upon their 
Spacing during acquisition. The greatest resistance to extinction is found 
When the same spacing of trials is used for both; when EE : EE 5 
used, resistance to extinction decreases. Since Stanley (1952) shows experi- 
Menta] evidence for this same point, we can conclude thot TEI EH 
ance to extinction is produced by using the same distribution of trials in 
both extinction and conditioning and that slightly Brealey CTBT in EE 
ance to extinction occurs if the distribution of trials in extinction is faster, 
rather than sJower, than that in conditioning. se othston de Hoo 

Effort and extinction. The role that effort plays in ট্‌ tioHon, Gn ES 
tant for theoretical reasons. One of the major bone on Bn Breed রি 
has assigned the decrement in response to a HENSON % 


Continued FH - . 
responding. vides evi- 
A Well-known experiment by Mowrer and TONES ra These 
dence Suggesting that effort does influence Tt Ee box Daring 
k gS ন 

Investigators trained rats to press a lever for food in the 4 


in di ounts, so that the 
Acquisition the lever Was counterweighted in different am i a 
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rats learned to press the bar when it required 5 grams to activate the feeder, 
when it required 42.5 grams, and when it required 80 grams. At the end 
of acquisition the rats were divided into three groups, and each group was 
extinguished with a different counterweight attached to the lever. There 
was a simple negative relationship between resistance to extinction and the 
amount of effort required to push the bar. This general effect is confirmed 
in a study of Solomon’s (1948) in which the distance rats had to jump to 
reach a goal box was the variable producing effort in the response. 

These two experiments leave something to be desired, however. It is 
possible that the differences in resistance to extinction were due to varia- 
tions in the amount of training given the different responses. When careful 
controls are introduced into an experiment on effort, resistance to extinction 
appears to vary only a small amount for responses requiring different 
amounts of work (Maatsch, Adelman, and Denny, 1954). Furthermore, 
the differences that do occur may be the result of differences in operant 
level of responding for various conditions; that is, rats may “spontaneously” 
engage in easy responses more frequently than in hard ones. 

Thus, while it seems likely that work and time are variables in determin- 
ing the course of extinction, they are not the most important ones. The 
results achieved in well-controlled experiments suggest that other factors 
are much more significant in determining the behavior of individual animals. 
As we shall see presently, this conclusion has important implications for 
theories of extinction. 

Spontaneous recovery and disinhibition. Pavlov, in his studies of con- 
ditioned salivation, discovered two phenomena that he thought to be of 
considerable theoretical importance. They are called spontaneous recovery 
and disinhibition. Again, we link them together because they are important 
in the theory of extinction. In both Spontaneous recovery and disinhibition 
the major effect is a recovery of the conditioned response after a certain 
amount of extinction. Thus, a Strong argument can be made that extinction 
generates an active inhibition of the conditioned Tesponse and that this in- 


hibition can be removed by particular conditions. Let us examin 
neous recovery rather closely, 


of the two phenomena. 


্‌ € sponta- 
since it is the more important and pervasive 


Spontaneous recovery. An example from Pavlov’s laboratory will illus- 
trate the basic principle of spontaneous Tecovery. Pavlov (1927) produced 
a conditioned salivary response by allowing a dog first to see some meat 
powder and then to eat it. After the dog had learned to salivate at the sight 
of the meat powder, Pavlov extinguished the response by no longer putting 
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the meat powder into the animal’s mouth. It took only a few trials of this 
Sort to extinguish the conditioned response. The animal was then removed 
from the experimental room for approximately two hours. When the dog 
Was brought back and allowed to look at the meat powder again, the con- 
ditioned response returned. True, the amount of saliva flow was not so great 
this time as it had been before the extinction process was begun, but there 
Was no doubt that the conditioned response had recovered to a considerable 
extent—actually about one-sixth of the original amount. 

Another experiment (Ellson, 1938) illustrates the occurrence of sponta- 
neous recovery in an instrumental conditioned response. Four groups of rats 
Were trained to press a lever in the Skinner box. Following the conditioning 
Procedure, all animals were permitted to operate the lever without rein- 
forcement until they had failed to respond for a period of five minutes. One 
Eroup of rats was then removed from the box and allowed to rest for five 
minutes. These rats were replaced in the box and additional unreinforced 
responses were counted. The second group of rats resumed extinction after 
a pause of 25 minutes, the third group after 65 minutes, and the fourth 
after 185 minutes. All animals showed an immediate tendency to press the 
lever when replaced in the box, even though all of them had failed to press 
the lever for a period of five minutes prior to their removal from the box. 
Figure 10 shows that the amount of recovery, or the number of responses 
after rest, is an increasing function of the length of the rest period. Like 
most effects associated with the learning process, however, this function 
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Figure 10. Number of lever responses in a second extinction period following various intervals of 
rest. In this study most spontaneous recovery occurs within an hour after original extinction. (Data 


from Ellson, 1938.) 


52 THE PSYCHOLOGY OF LEARNING 


seems to have its limit, so that an extremely long rest period yields no more 
spontaneous recovery than a moderately long one. In this experiment the 
amount of recovery reached its limit at about one-fifth the original strength 
of the response. 

Spontaneous recovery is a genuine and stable part of the extinction process. 
We may, for example, try to stop our dog from begging at the dining-room 
table by ignoring it when it does so, and we may find by the end of the meal 
that we have been successful—the dog has retired to the living room. Never- 
theless, the next evening it will probably beg as before. Since spontaneous 
recovery is incomplete, however, the dog will not be so persistent this time 
and will stop sooner. Repeated extinction periods will finally eliminate the 
response. 

Disinhibition. The term disinhibition stems from Pavlov’s theory of 
extinction, and he placed great stress on the importance of this effect. In his 
Original demonstration of disinhibition, Pavlov established a conditioned 
response to the sight of meat powder. Once it was well established, he 
extinguished it by showing the meat to the dog without letting the animal 
eat it. When the conditioned response was completely extinguished, Pavlov 
paired a new stimulus with the conditioned stimulus. This stimulus was not 
one that could elicit the conditioned response in itself, but when it was 
paired with the extinguished conditioned stimulus it did. Thus, the salivary 
response to the sight of food, which had been completely extinguished, came 


back again when the new stimulus, vibration, was presented with the con- 
ditioned stimulus. 


THEORIES OF EXTINCTION 


Animals cease responding when they are no longer rewarded. So simple 
a fact would hardly seem to demand very subtle theoretic 


yet extinction has been one of the aspects of behavior in 
complicated theorizing. Actually, 


damental to any theory of behavi 
form unreinforced acts is just as 
forced acts in the first place. 

In the following section we shall stud 
theories of extinction. Then we shall try to evaluate the current status of 
the theoretical questions and examine Some of their weaknesses. 

Response-produced inhibition theory. The Tesponse-produced inhibi- 
tion theory is now the classical theory of extinction. Like “classical” theories 


al interpretation; 
volving the most 
Of course, a theory of extinction is fun- 
Or, for the fact that animals cease to per- 
important as the fact that they learn rein- 


y a few of the more important 
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in many other fields, however, it has been thoroughly demolished by critics 
and the rubble used to build new theories. Although the response-produced 
inhibition theory is no longer acceptable in its entirety, two of its ideas are 
Still of considerable importance. For this reason, and also because it pro- 
Vides a good case history of the fate of psychological theories, we shall 
look at it in some detail. 

The most complete statement of the response-produced inhibition theory 
comes from Hull (1943), though he took the basic concept of inhibition 
from Pavlov and borrowed other ideas from a variety of sources. 

Reactive inhibition. Hull started with the assumption that whenever an 
Organism makes a response it also generates some inhibition to that re- 
sponse. This means that, all other things being equal, once a response takes 
Place, it is somewhat less likely to occur in the immediate future. Such in- 
hibition may be thought of as analogous to fatigue in that it makes the next 
response more difficult. Like fatigue, this reactive inhibition disappears 
after a period of rest; if there is not enough rest between responses, how- 
ever, the inhibition accumulates from response to response. 

Hull (1943) advanced the hypothesis that the amount of inhibition that 
results from a series of responses is a positively accelerated function of the 
amount of work the response involves." In other words, as the response 
requires more effort, the inhibition accumulates at an increasing rate. 
Further, he specified, rest reduces the inhibition generated by each response 
in a way that is best described by a simple negative exponential function. 
Eventually, with enough rest, the response will completely recover its 
Strength. 

In this theory of Hull's, reactive inhibition will accumulate whether the 
response is reinforced or not. If the response is reinforced, the positive 
effects of reinforcement must overcome the negative effects of reactive in- 
hibition. Most of the time the positive effects will keep the animal respond- 
ing, but there is an interesting exception—a matter of experimental fact— 
Which lends plausibility to Hull's notion. If, during conditioning, trials are 
Massed very close together, the animals may slow their rates of responding 
and even stop altogether, even though the reinforcement is still present 
(Hovland, 1936; Calvin, et al., 1956). This effect has been called “inhibi- 


tion of reinforcement.” 


THull later (1951) considerably revised the substance of this assumption: he sug- 
Eested that for responses involving large amounts of work (relative to the ability of 
the animal involved) the inhibition generated was not much greater than that for 
Smaller amounts of work. In other words, the function which related responding and 


Work was S-shaped. 
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The most important aspect of the Tesponse-induced inhibition theory, 
however, concerns the results of extinction. When responses are no longer 
reinforced, reactive inhibition is thought to accumulate without being coun- 
teracted by any positive effects of reinforcement. The result is that the 
animal eventually ceases to respond. Since reactive inhibition disappears 
with time, we should predict Spontaneous recovery of the response after a 
rest. Remember, however, that in the example from Pavlov’s laboratory, 
Spontaneous recovery was only about one-sixth the Original response strength. 
At best, spontaneous Iecovery regains approximately 50 per cent of the 
Tesponse strength at the end of conditioning. Since 
incomplete, Hull postulated a second factor, condi. 
count for the permanent effects of extinction. 

Conditioned inhibition. Hull regarded reactive inhi 
motivational state. Presumably, animals are motivated to avoid reactive in- 
hibition as much as they are motivated to avoid an electric shock. If we 
accept the need-reduction theory of reinforcement, it is easy to see how 
escape from, or avoidance of, reactive inhibition would constitute a rein- 
forcement. Thus, when an animal ceases to respond, its fatigue state, or 
reactive inhibition, is reduced. This, in keeping with the need-reduction 
notion, is reinforcing. Reduction of reactive inhibition supposedly reinforces 


the animal for doing nothing, and the resulting failure to respond is called 
conditioned inhibition. 


spontaneous recovery is 
tioned inhibition, to ac- 


bition as a negative 
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trials are massed during extinction and when a greater amount of effort is 
required in performing the conditioned response. 

As we shall see, this two-factor theory of extinction (as it has been 
called) has been applied to a variety of problems in behavior—for example, 
to the fact that rats generally prefer not to repeat the same response on 
Successive trials in a maze. To take another important instance, it has been 
applied to the pronounced effects of massed practice and recovery with rest 
On the performance of complicated human skills, such as learning to type. 
Despite the widespread application of the theory, however, we now suspect 
that it contains some serious defects, and it is therefore important that we 
look at some of the criticisms of the theory. Since attempts to examine this 
theory experimentally have led to some valuable new information about 
behavior, the next section will serve as more than just a critical analysis of 
the response-produced inhibition theory. 

A critical analysis of the response-produced inhibition theory. The re- 
Sponse-produced inhibition theory, as stated by Hull, has been criticized on 
three grounds: (1) the theory is a good deal more ambiguous than it first 
appears (Koch, 1954); (2) it leads to contradictory deductions (Gleitman, 
Nachmias, and Neisser, 1954); and (3) the empirical evidence for it is 
Weak, and that against it, strong. 

Let us first look at the empirical evidence against the theory. This is per- 
haps the least critical objection to the theory, since almost any theory can 
be modified to fit a new fact. The facts stated below, however, are particu- 
larly awkward for the response-induced inhibition theory. 

Extinction without responding. The most important single idea in the 
Tesponse-produced inhibition theory is that decrement in responding is the 
Tesult of a negative feedback from responding itself. Thus, the more effort 
a given response requires, the greater the difficulty in producing an addi- 
tional response. Yet there is evidence that a decrement in responding can 
Occur even when no overt response causes it. There is the phenomenon of 
“subzero” extinction, reported by Pavlov (1927). Pavlov showed that if 
We extinguish a classical conditioned response to the point at which it com- 
Pletely disappears, and then continue to present the conditioned stimulus, 
these additional trials weaken the conditioned response even further (shown 
by Spontaneous-recovery tests). This means that presenting the conditioned 
stimulus alone was effective in producing further reduction of response 
Strength—it was not necessary for the animal to give the conditioned re- 
Sponse. Thus, response-produced inhibition could not have occurred. 
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It is also difficult for the response-induced inhibition theory to explain 
data derived from experiments on nonresponse extinction. The first example 
of such extinction comes from Seward and Levy (1949). They trained 
rats to run down a runway to secure food Placed in a goal box at the other 
end. After conditioning, one group of rats was placed directly in the goal 
box when it was empty of food and permitted to see that the reinforcement 
was no longer there. A second group of rats did not have the benefit of this 
preview. The next day, all the rats were given a series of extinction trials. 
The group that had been able to see beforehand that the food was no longer 
there extinguished approximately twice as quickly as the other rats. It was 
evident that the response had been reduced in strength by the sight of the 


empty box and therefore extinction took place without the running re- 
sponse. 


Several other experimenters have been able to find “extinction without 


; Moltz, 1955), and others have 
harlock, 1954); the effect may 
erstood. Those experiments that 
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missing extinguished more rapidly. 
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Spontaneous recovery from extinc- 
Ct the presence of inhibition induced by the 


EXTINCTION AND PARTIAL REINFORCEMENT 57 


The experiment by Moltz (1955) suggests that extinction without re- 
sponding is the result of the reduction of the secondary reinforcement value 
Of the cues from the reinforced side of the maze. This leads to the proposi- 
tion that extinction involves the reduction in the strength of secondary cues 
to “command” behavior when they are no longer associated with primary 
reinforcements. Hurwitz (1955) points out that, in the Skinner box, if one 
extinguishes early members of the response chain (approaching the lever, 
for example), one also extinguishes the later members (pressing the lever), 
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Figure 11. Learning and extinction of the correct choice in a T maze. “Response extinction indi- 
Cates tests for the number of correct choices made after the usual extinction procedure. “Nonre- 
SPonse” extinction indicates choices made after the animals had been placed directly in the empty 
Socal box. Running to the goal box does not seem necessary to produce extinction. There is sponta- 
neous recovery following “response extinction but little or none following “nonresponse” extinc- 


tion. (Deese, 1951.) 


and he suggests that this may be due to the weakening of the goal gradient 


Or gradient of reinforcement. 

Response inhibition and stimulus satiation. One of the facts that have 
tended to support the theory of response-produced inhibition is provided 
by studies which show that, given a free choice, animals prefer not to repeat 
Tesponses they have just performed. For example, if rats are not reinforced 
for going to either side of a single-choice T maze, they will tend to alternate 
Choices and not repeat their last choice. If response-produced inhibition 
Provides an explanation of alternation behavior, there are a number of de- 
ductions that can be made from the properties of response-produced inhibi- 
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tion. For example, decreasing the time interval between trials ought to 
increase the frequency of alternation. Likewise, increasing the effort in- 
volved in making a response should increase alternation. In another instance, 
forcing animals to go consistently to one side of a T maze for a while ought 
to make them tend to choose the opposite side. This tendency to choose the 
opposite should decline if an interval of time is allowed between the forced 
trials and the free-choice trials (reactive inhibition disappears with rest). 
In general, these deductions stand up to experimental test (Heathers, 1940; 
Zeaman and House, 1951; Riley and Shapiro, 1952). 

Despite this rather impressive array of evidence, investigators of animal 
behavior, other than learning theorists, have long held another explanation 
for the failure of responses with repeated elicitation. One investigator 
(Prachtl, 1953) studying the gaping response (the yawnlike opening of the 
mouth observed in young birds when the parent bird returns to the nest 
with food) showed that, while repeated elicitations of the response will 
exhaust it, changing the stimulus will make the response come back again. 
Actually, many experiments have shown the major cause of alternation and 


related phenomena to lie in stimulus conditions rather than i 


n the repetition 
Of responses per se. 


One simple experimental result seems strongly 
tion exclusively in terms of response-produced 
the stimuli associated with each pathway in a 
will tend to repeat rather than alternate their ch 


1953). This suggests that the rats £0 where the 
larly, 


against a theory of alterna- 
inhibition. If we exchange 
two-alternative maze, rats 
oice of pathways (Glanzer, 


re are “new” stimuli. Simi- 
another experiment shows that rats alternate places they go to more 


frequently than specific responses in getting there (Montgomery, 1952). 

A rather exhaustive experiment by Rothkoptf and Zeaman (1952) shows 
that the control of alternation in a T maze is not simple; they s 
both inhibition from response and such factors as novelty of exte 
lation control the tendency to alternate. Thus, while it is pr 
Tesponse-induced inhibition plays a part in such Phenomena as 


of responses, it is certainly not the only factor, and 
most important. 


The present status of the response-produced inhibition theory. Putting 
all the evidence together— evidence concerning the distribution of trials, the 
effects of effort on response tendency, alternation behavior, etc we must 
conclude that responding has an inhibitory effect upon future tendency to 
respond. Beyond this, however, we cannot £0, for it is equally evident that 
this feedback from responding is not necessary to extinction. Furthermore, 


uggest that 
Tnal stimu- 
obable that 
alternation 
Perhaps not even the 
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it appears that there are rather severe logical problems in applying the 
notion of response-induced inhibition to extinction itself. 

The most serious logical problem in the response-induced inhibition 
theory as it is stated by Hull is that it implies that all behavior must eventu- 
ally cease, whether it is reinforced or not (Gleitman, Nachmias, and Neisser, 
1954). For even though habit strength for any particular response will even- 
tually reach a maximal value, inhibition continues to be added for each 
response. As inhibition is added, so is conditioned inhibition, which thus 
continues to grow while habit strength remains constant. The net effect of 
this will be to make the response weaken and eventually disappear. It is 
true, as we saw above, that responses do sometimes weaken in strength 
even with continued reinforcement and, indeed, they may even eventually 


disappear (Calvin, et al., 1956), but this is the rare exception. 

The root of the matter is that the concept of conditioned inhibition is 
Weak. As originally stated in Hull’s theory of learning, it leads to contradic- 
tory properties. We have seen that extinction can occur when conditioned 
inhibition based upon response-produced inhibition should be very weak. 
Therefore, it would seem that the notion of conditioned inhibition is super- 
fluous. This would leave us with the idea that there is a temporary, fatigue- 
like decrement produced by continued responding. This inhibition may have 
Something to do with the organism temporarily ceasing to respond, but it 
cannot have much to do with the permanent effects of extinction. It is clear 
that, even if we do accept the general proposition of response-produced 
inhibition, we will need some concept that will help explain the permanent 


effects of extinction. 


CURRENT ALTERNATIVE THEORIES OF EXTINCTION 


Competition theory. The best-known alternative theory of extinction is 
that put forward by Guthrie (1935, 1952) and, to some extent, by Estes 
(1950). In a word, this theory attributes extinction of a particular response 
to the acquisition of competing response tendencies. Thus, when an animal 
Ceases to respond in a particular situation, it is because it has learned not 
to do so by an association between “not doing so” and the stimuli. In some 
Cases the animal will learn alternative responses and in others it will learn 
to do nothing. 

This leads Guthrie to say that there are three ways in which responses 
may be eliminated. The first method is to introduce the conditioned stimulus 
subliminally, so that it will not elicit a response, and then to increase its 
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strength gradually. This is what we do when we train a horse to saddle by 
first putting on only a light blanket and then gradually working up to full 
gear. The blanket is not a sufficient stimulus to set off bucking, and the 
horse is not disturbed by the gradual addition of heavier loads. An experi- 
ment by Kimble and Kendall (1953) shows that this method of “toleration” 
works quite well in a simple avoidance habit in rats. A second method is 
to repeat the conditioned stimulus until the original response is exhausted 
and the organism is too fatigued to respond. This is the bronco-busting tech- 
nique and, incidentally, the special case handled quite well by the condi- 
tioned-inhibition notion. The third method is to present the conditioned 
stimulus when it is mechanically impossible for the Organism to respond. 

Aside from providing this common-sense analysis of methods of eliminat- 
ing behavior, the theory of competition advanced by Estes has the merit of 
considerable logical elegance. For one thing, conditioning and extinction be- 
come simply different aspects of the same thing—they are both learning 
by a process of associating elements of particular responses with elements 
of stimulus complexes. Furthermore, as Estes (1955) points out, Sspon- 
taneous recovery under this theory is not a special Phenomenon of inhibi- 
tion, but results from the fact that samples of stimuli conditioned (or ex- 
tinguished) on any one trial may not be present on the next; thus the 
tendency to respond can “spontaneously” change between trials.2 

There are difficulties, unfortunately, with the competition theory, too. 
For one thing it becomes rather cumbersome when applied to the general 
cases of indirect extinction discussed under Extinction without Respond- 
ing. Furthermore, there are practically no simple relationships between the 
properties of conditioning and extinction, for a maximal rate of responding 
can sometimes be achieved in the Skinner box with only one reinforcement, 
while extinction after one reinforcement may require as many as 150 re- 
sponses before the rate of responding is returned to the unconditioned level 
(Skinner, 1938). In classical conditioning, extinction is usually more rapid 
than the original conditioning. 

These, however, are relatively minor Objections, not nearly so serious as 
those to the theory making use of the concept of response-induced inhibi- 
tion. There has been a serious effort, moreover, to examine some of the 
experimental implications of the theory (see, for example, Lauer and 
Estes, 1955). Perhaps the most serious Objection to the theory is not a 
logical one. That is, there is some reason for believing that extinction, in 
2 For an experimental analysis of the role of spont 


EE Aneous recovery in a formal com- 
petition theory, see Homme, 1956. 
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Part at least, reflects factors other than loss of habit. Let us examine these 
in the context of other theories. 

The frustration theory of extinction. Many experimenters have ob- 
Served that when an instrumental habit is extinguished animals tend to be- 
come emotionally excited, as if they were frustrated by the withdrawal of 
reinforcement. Skinner (1938), for example, has noted this in rats under- 
going extinction and has attributed to it the comparative irregularity of 
extinction curves. More specifically, some investigators have assumed that 
extinction gives rise to a frustration drive and that this accounts for some 
of the effects one finds in a study of extinction. 

One simple runway experiment (Adelman and Maatsch, 1955) shows that 
if rats are allowed to escape from the empty goal box during extinction, 
they do not show any loss for the originally reinforced habit of running 
down the runway. Of course, if these animals are confined in the goal box, 
they do show a loss of response strength. One interpretation of this is that 
escaping from the frustrating conditions of extinction does not produce the 
Same emotional aversion to the empty goal box as being confined in it. 

Another series of experiments (Lambert and Solomon, 1952; Lambert, 
Lambert, and Watson, 1953) show an effect that is difficult to interpret ex- 
cept by assuming that extinction gives rise to a state of frustration. These 
experiments show that the closer an organism gets to the reinforcement be- 
fore it is frustrated, the harder it is to extinguish a habit. In one of the ex- 
Periments this effect was discovered by blocking rats at various points in 
a simple runway; the farther the block was from the goal box, the more 
quickly the rats gave up running. In the other experiment, essentially the 
same effect was found with a token-reward habit in young children. 

The interpretation of these experiments is that frustration is greater when 
the organism is blocked just before reaching the goal box than when it is 
blocked farther away. This frustration heightens the animal’s drive and, 
hence, its activity, so that it interferes with either the acquisition of a new 
habit or “learning not to respond.” 

It is probably true that the failure of reinforcement for many instru- 
Mental acts does produce frustration, and that this has special effects on 
behavior, as the above experiments show (also, see Stanley and Rowe, 
1954). Frustration does not provide an explanation of extinction itself, 
however; it only suggests how the process of extinction can be modified by 
emotional activity. 

Extinction as loss of incentive value. Strangely enough, the hypothesis 
that is probably the simplest has received the least attention. This is the no- 
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tion that extinction has little or nothing to do with habit or learning, but, 
rather, produces a reduction in response strength by changing the value of 
the incentive (Johdai, 1955). The effects of extinction, according to this 
argument, can be considered simply as the limiting case of the reduction in 
size of reinforcement after learning has taken place (Dufort and Kimble, 
1956). Given the fact that behavior varies according to the size of rein- 
forcement, it seems a theoretical extravagance, as Dufort and Kimble point 
out, to assume that completely different processes take place in extinction— 
reduction of reinforcement to zero after previous reinforcement. 

In the last chapter we discussed another special case of the absence of re- 
inforcement under latent learning. In the experiments on latent learning we 
saw that animals could learn something about a maze even though they 
were not specifically reinforced in the goal box. The fact that the animals 
had learned the maze, however, was not evident until some incentive was 
given them. In the same way, if we extinguish a maze habit by withdrawing 
reinforcement after the maze has been learned, the fact that animals will 
now make more errors (see Tolman and Honzik, 1930) does not mean 
that they have unlearned the correct path to the goal box. Rather, the rats 
appear to have no interest in getting to the goal box. There are, of course, 
many implications in considering extinction to be a special case of zero 
amount of reinforcement. Many of these implications were discussed in the 
previous chapter under amount of reinforcement. 

The present state of theories about extinction. We have seen that there 
is a temporary negative feedback effect on responding resulting from pre- 
vious responses. This effect provides the basis for the concept of response- 
Produced, or reactive, inhibition. Reactive inhibition disappears with rest, 
however, and except for special cases, it is doubtful whether it has much to 
do with the more permanent effects of extinction. In addition there is an 
analogous phenomenon in stimulation (Hinde, 1954; Glanzer, 1951); that 
is, all other things being equal, animals tend to avoid stimuli they have just 
been subjected to. It is likely that this “stimulus satiation, 


hibition, is largely temporary in nature, 
thoroughly explored. 


” like reactive in- 
though this issue has not been 


Furthermore, responses sometimes disappear because Organisms learn 
new responses that displace the old ones. In classical conditioning this may 


mean that the organism acquires a new response to stimu 


li already condi- 
tioned to another, y 


incompatible response. In instrumental conditioning it 
may mean that the organism learns a new method for receiving the same 
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reinforcement or that it learns that a different set of stimuli is not corre- 
lated with the occurrence of reinforcement. 

There is one important question about this matter of extinguishing an old 
response by learning competing activity. It is not clearly established that 
learning the new response necessarily results in “unlearning” the old one. 
The theories advanced by Guthrie and Estes suggest that this is so, but 
there are good grounds for assuming that it is not always true. It is clear, at 
least, that learning a new incompatible response does not always reduce the 
Old response to its original, unlearned strength. Those theories that reduce 
all conditioning to the paradigm of classical conditioning would suggest that 
this is so simply because all of the cues that were conditioned to the old 
response are not conditioned to the new one, sO that stimuli conditioned to 
the old response are left over. 

At any rate, one of the things that should soon be settled is whether com- 
peting responses always result in the unlearning of the originally established 
response or simply have a temporary effect on performance. If the latter is 


true, when the competing response disappears, the original response should 


spontaneously recover in strength. There is some experimental evidence 


that the latter case is true at least in some situations (Whiting and Mowrer, 
1943). This is a complicated question, and we shall see it again when we 
consider discrimination and conflict in animal behavior, and transfer in hu- 
man verbal behavior. 

In addition to the factor of competition, it is possible that a large number 
of the examples of extinction studied in the laboratory (as well as many 
Problems in animal behavior in nature) are the result simply of the change 
in incentive or reinforcement. After our discussion in the last chapter, we 
Should suspect that such examples of extinction primarily involve changes 
in what animals are willing to do rather than in what they learn. Thus, many 
Teductions in strength of response by nonreinforcement involve reduction in 
the momentary tendency to respond rather than any baie :hAbIt DAHLE - 


PARTIAL REINFORCEMENT 


We have seen that learned behavior is maintained in high strength by 
cinforced, it declines in strength. This 


reinforcement, and that if it is not rr 
effect of reinforcement determines what animals learn, since they cannot 
have some experience; the occur- 


learn in situations in which they do not £ ; 
rence of reinforcement determines the kinds of experiences organisms are 
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going to have. Thus far we have considered only two particular cases of re- 
inforcement, one in which occurrence of a response is reinforced, and the 
other in which there is no reinforcement at all. It is obvious, however, that 
there may be intermediate cases. We could, for example, reinforce every 
other occurrence of a response. In general, any case in which we reinforce 
behavior some, but not all, of the time is an example of partial reinforce- 
ment. The schedule of reinforcement is an extremely important variable in 


the control of behavior, and it will be worthwhile to examine this problem 
in some detail. 


SCHEDULES OF REINFORCEMENT 


In general, there are two ways we can control the patterning of reinforce- 


ments and responses. In the first case we can Vary the time between rein- 


forcements irrespective of the number of responses between reinforcements. 
Thus, we may arbitrarily decide to reinforce once every five minutes, re- 
gardless of whether the animal responds one hundred times or only once 
during the five minutes. The other way in which we can change the pattern 


of reinforcements is to Vary the frequency of responses between reinforce- 
ments. In this case, we can decide to reinforce every tenth response, whether 
10 responses take five minutes or one. I 


n using these two methods, we can 
alter the reinforcement schedule so th 


’ > 


ally every minute or every three minutes. 


eduling reinforcements have somewhat dif- 
ferent effects, let us now look at them briefly. 


Fixed-interval reinforcement. Fixed-interval, Or periodic, reinforce- 
ment occurs on a fixed time schedule. This kind of reinforcement has a par- 


behavior. When an ani- 


, its behavior becomes stable 
will emit a constant number of responses per reinforcement. If, for exam- 
ple, the animal Lives an average of 20 responses for each reinforcement and 
it is being reinforced once a minute, its rate of responding will be 20 re- 
sponses per minute. If, however, the animal is reinforced Once every two 
minutes, these 20 responses will be spread out Over that period, so that its 


y minute. The general rule is that the 
rate of responding is inversely proporti 


+ In general, the animal 
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Variable-interval reinforcement. The other extreme of the interval 
schedule is to administer the reinforcement at random times. Thus, after an 
animal has learned a particular response, we might reinforce it the first time 
after one minute, then after thirty seconds, then after three minutes, etc. 
Under these conditions behavior becomes extremely stable, the animal re- 
sponding with almost machinelike precision. 

Fixed-ratio reinforcement. In fixed-ratio reinforcement the reinforce- 
ment is regularly delivered after a specific number of responses. Here, as we 
might expect, the effects on performance are dramatically different from 
those of fixed-interval reinforcement. First of all, reinforcement on a ratio 
schedule places a premium on responding rapidly, for the higher the rate of 
responding, the higher the rate of reinforcement. If an animal is being rein- 
forced once every 10 responses, it can get a reinforcement every 10 seconds 
by responding once per second. Even more characteristic of fixed-ratio rein- 
forcement is the fact that the rate of responding is variable. Immediately 
after a reinforcement it is low, and it gradually increases until the next rein- 
forcement occurs. Since this acceleration occurs as the ratio of reinforce- 
ments to responses gets low, unlike interval schedules, fixed-ratio schedules 
Produce higher average rates of responding with fewer reinforcements. 

Variable-ratio reinforcement. The variable-ratio reinforcement case is 
Perhaps the most interesting, because it is probably more characteristic of 
the conditions in which natural behavior occurs. Variable-ratio reinforce- 
ment is administered after different numbers of responses. If we reinforce a 
Well-learned habit on a variable-ratio schedule, we do not get the increase 
in rate of response between reinforcements obtained with fixed-ratio rein- 
forcement since the animal cannot predict the exact number of responses 
necessary for a reinforcement; there is a premium on responding rapidly, 
for getting a reinforcement eventually depends upon the number of tries. 

The effects associated with particular ways of administering reinforce- 
Ment are all effects on performance—they are things that will happen to 
A well-learned response when a partial reinforcement schedule is introduced. 
Let us now look at the effects of partial reinforcement upon the specific 


Conditions of acquisition and extinction. 


EFFECTS OF PARTIAL REINFORCEMENT ON 
LEARNING AND EXTINCTION 


The effects of partial reinforcement on acquisition. If we tried to train 
a Tat to press a lever in the Skinner box by reinforcing it once for every 10 
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responses, we would have a long wait before the animal learned the es 
sponse. In general this would be true in any situation in which the major 
thing to be learned is the relationship between a response and its conse- 
quence—the reinforcement. Since learning such a relationship is the essence 
of most studies on simple instrumental conditioning, it is not surprising that 
a review of the literature on partial reinforcement (Jenkins and Stanley, 
1950) shows that most studies reveal a tendency for anything less than 100 
per cent reinforcement to slow the rate of acquisition. In other words, rein- 
forcing simple instrumental acts part of the time rather than all of the time 
appears to slow down the rate of learning. 

This retarding effect of partial reinforcement is probably the indirect ef- 
fect of the many conditions that determine an animal’s performance in a 
given situation. We saw in the last chapter that such factors as delay of re- 


inforcement, amount of reinforcement, etc., work primarily on performance 


rather than on rate of learning; any effect upon rate of learning of these 


reinforcement variables is indirect, determined by the fact that performance 
is frequently a major determiner of what animals learn. Much the same 
thing is true of the schedule of reinforcement; any effect it has upon learn- 
ing is probably achieved through an alteration in the nature of the instru- 
mental act or of the stimulus-response sequence that the animal must learn. 

The effect of partial reinforcement on resistance to extinction. The 
most dramatic effect of partial reinforcement is On resistance to extinction. 
All schedules of reinforcement affect resistance to extinction somewhat dif- 


ferently, but it is a general rule that Partial reinforcement will nearly always 
Produce greater resistance to extinction than 100 per cent reinforcement 
(see Jenkins and Stanley, 1950). 

Earlier in this chapter we saw th: 
tivity had been reinforced determi 
activity would be performed in the 


at the number of times a particular ac- 
ned, up to a limit, how many times the 

absence of reinforcement. The maximal 
resistance to extinction after a 100 per cent reinforcement schedule is not 
great. No matter how many times we reinforce lever pressing in the rat, it 
is unlikely that the animal will res 


[ pond much more than 200 times without 
reinforcement. Thus, habits estab 


i lished and maintained with 100 per cent 
reinforcement are not Very resistant to extinction. 


Humphreys (1939) on a classical conditioned response showed that a sched- 
ule of partial reinforcement greatly in: 


creased resistance to extinction. Ex- 
cept for very special conditions, nea 


ve rly every experiment since then has 
shown similar results. Not only does partial reinforcement make a habit 


EXTINCTION AND PARTIAL REINFORCEMENT 67 


more resistant to extinction, but one experiment on maze behavior in the 
rat shows that such reinforcement makes it more difficult to learn a new 
conflicting habit (Wike, 1953). The large effects of partial reinforcement 
On extinction may be seen in Figure 12 (Jenkins, McFann, and Clayton, 
1950). 

This particular effect of partial reinforcement is of the greatest theoretical 
and practical importance. For one thing, it enables us to understand why 
behavior in natural settings is likely to be so persistent—most behavior is 
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Figure 12. Cumulative extinction curves after partial and continuous reinforcement. For the par- 
e the animals were placed on a variable-interval schedule. The animals 


Ea Ertan condlion, McFann, and Clayton 
s le 


in both groups received 200 reinforcements before extinction began. (Jenkins, 
1950.) 


Probably not reinforced for each try. We also know that the most efficient 
Use of reinforcement is obtained by a partial reinforcement schedule (for 
most purposes, a variable-ratio one), probably after an initial group of 100 
Per cent reinforced responses. In this way we TNE the strength of rein- 
forcements that are likely to suffer from satiation effects (such as new toys 
do for an infant), and, at the same time, we produce the largest resistance 
to extinction. | fl 
There is apparently no limit to the extent to which a schedule of very in- 
frequent reinforcement can increase resistance to EEL DELON Even in the 
laboratory it is possible to produce habits that will resist the extinguishing 
effects of thousands of unreinforced responses. As a matter of fact, Skinner 


68 THE PSYCHOLOGY OF LEARNING 


(1950) suggests that if conditions are proper it is possible to produce an 
extinction curve with no curvature; in other words, the animal goes on re- 
sponding without reinforcement ad infinitum. Incidentally, Skinner remarks, 
Such a result is a telling blow against a theory of extinction built upon the 
accumulation of a response-produced inhibition. 

Some theoretical importance was Originally attached to the possibility 
that wider spacing of trials might have the effect of reducing the heightened 
resistance to extinction brought about by partial reinforcement (Sheffield, 
1949). Subsequent experimental work, however, has established that such 
a reversal does not always occur under spaced practice, and indeed proba- 


bly does so only under very special circumstances (Grant, Schipper, and 
Ross, 1952; Weinstock, 1954). 


INTERPRETATIONS OF PARTIAL REINFORCEMENT 


We have seen that the importance of 
First of all, a comparison of different SC 
that momentary characteristics of respon 
ticular schedule of reinforcement under 
Secondly, we saw that partial reinforcen 
creasing the resistance to extinction. Thi 


point, and it is the one that has received the most attention from theorists 
and been applied most to theories of Personality and development. Part of 


the interest in the effects of Partial reinforcement has centered in efforts to 
explain why partial reinforcemen 


Theories about these effects of i 
classes— stimulus theories and 


partial reinforcement is twofold. 
hedules of reinforcement showed 
Se strength depend upon the par- 
which an organism is performing. 
nent generally has the effect of in- 
S is perhaps the most fundamental 


response theories. 


Stimulus Theories 


Some of these. 
Expectancy. 
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When reinforcement stops altogether, this animal is likely to be more per- 
sistent. 

This concept of expectancy has been roundly criticized on several counts. 
Some critics have claimed that it is an anthropomorphic notion. Others 
have pointed out that the definition of expectancy is vague and ambiguous. 
Certainly, the criticism of anthropomorphism is justified. Human learners 
are capable of expectancy—as a form of verbal behavior. We can tell our- 
selves when we ought to expect something to happen. It is doubtful that 
animals are capable of expectancy in this sense. 

The most serious criticism of the notion of expectancy is that it is an 
ad hoc explanation. It is acceptable enough in itself if, for example, it is 
defined as the ability to respond to a conditioned cue before the uncondi- 
tioned stimulus appears. When the buzzer sounds the dog salivates; then 
the meat powder is placed in its mouth. If this is all one means by expect- 
ancy, there is no quarrel. This definition of expectancy (and it is nothing 
More than a definition) does not provide us, however, with a very powerful 
explanatory tool. We may apply it to the problem of partial reinforcement, 


and by making some further incidental definitions, come out with the pre- 


diction that partial reinforcement will lead to greater resistance to extinc- 


tion than 100 per cent reinforcement. | | 
We have dealt rather harshly with this idea of expectancy. Later in this 


book we shall examine questions about decision making in human beings 
in which the notion of expectancy has been considerably more useful than 
it has been in the theory of simple instrumental and classical conditioned 


Tesponses. 


Discrimination. Anot ন | 
Partial reinforcement studies states that under partial reinforcement sched- 


ules it is less easy for the animal to discriminate between conditions of re- 
inforcement and those of extinction. This implies that the major determiner 
Of the rate (and probably the limit) of responding during extinction is the 
extent to which the situation during extinction is the same as that during 
Original reinforced training. In partial reinforcement there Bre Occasions 
When responses go unreinforced; consequently, when extinction is eventu- 
ally begun it is more difficult for the animal to discriminate between the 
Conditions of reinforcement and those of extinction. OO 

It would seem that this discrimination notion is quite similar toa general 
Concept of expectancy. The difference is that the discriminability hypothesis 
1S Stated in such a way as to preclude our attributing symbolic, human proc- 
Ssses to rats as they generate extinction curves. The word expectancy has 


her simple idea used to account for the results of 
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too much excess meaning for most people. Perhaps it is the better part of 
scientific caution to accept the discriminability notion as part of the ex- 
planation for the partial reinforcement effect and to dismiss the concept of 
expectancy. 

The notion of discriminability has other advantages, as we shall see later. 
The principal of these is that we know some of the rules for discrimination 
learning (see Chapter 4) and can use them to predict the specific effects of 
particular patterns of partial reinforcement. At any rate we can accept the 
notion that the increased resistance to extinction resulting from partial rein- 
forcement can be considered as partly due to a change in the discrimina- 
bility between the conditions of reinforcement and those of extinction. 


Response Theories 


Another way to look at the question of what produces the increased re- 
sistance to extinction with partial reinforcement is to consider the definition 
of the response unit. For most purposes we define the unit of response as 
that which will produce a reinforcement. Thus, in the Skinner box, it is a 
lever push capable of operating the feeder. It does not make any difference 
how the lever is pressed, as long as it is Pressed hard enough. By the same 
token we might argue that in any partial reinforcement situation the unit 
response is defined by the number of lever Presses necessary to activate the 
feeder. If we did this, we should consider all of the unreinforced lever 
pushes between reinforced Ones to be part of the activity necessary for rein- 
forcement and count them as only one response. Thus, if on a variable-ratio 
reinforcement schedule we reinforced 1 lever press in 10, we would count 
10 lever presses as only 1 response. On this variable-ratio reinforcement 
schedule 10 lever presses would be equivalent to 1 on a 100 per cent rein- 
forcement schedule. 

This argument was advanced by Mowrer and Jones (1945), 
showed that when responses are counted by this scheme, actually 
Sponses occur during extinction after partial reinforcement, ft 
course, more single bar presses occur. The Treason that fewer res 
Cur, argued Mowrer and Jones, is that a Stronger response. 
tion is induced by the greater activity required under partia 
It would have been interesting to see if this were also 
greater spontaneous recovery, 
question. 


and they 
fewer re- 
hough, of 
ponses o0c- 
-induced inhibi- 
1 reinforcement. 


accompanied by 
but Mowrer and Jones did not examine this 


While we have presented this notion as an effect upon responding and the 
hypotheses of expectancy and discriminability as effects upon stimuli, this 
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is, in the last analysis, largely a matter of convenience. Furthermore, there 
is nothing basically incompatible between these notions as they stand. It is 
possible that the unit of response as far as the organism is concerned is de- 
termined by what it “expects” or by the occurrence of a reinforcement. At 
any rate, while a more adequate explanation of the effects Of partial rein- 
forcement upon extinction may eventually be stated, it is probable that it 
will be close to those discussed above. 

Importance of partial reinforcement. The effects of partial reinforce- 
ment— particularly on extinction—are extremely important when applying 
the principles of learning and behavior theory to the understanding of be- 
havior under natural conditions. Most habits seem to be highly resistant to 
the effects of extinction, and this is very likely because behavior in natural 
life is reinforced on a probable rather than a certain basis. Thus, the young- 
ster who is beginning to get a command of language is likely to be persistent 
about asking for cookies, ice cream, etc., because, in the course of things, he 
may be reinforced on a variable-ratio schedule. Sometimes these variable- 
ratio reinforcements produce behavior which is highly resistant to the effects 
of extinction. Parents who are concerned about controlling the behavior of 
their offspring will sometimes try to introduce fixed-interval reinforcement. 
Thus, some children are supposed to be able to buy ice cream on Mondays 
and Thursdays but not on Tuesdays, Wednesdays, and Fridays. Since par- 
ents are not nearly so consistent as the schedules of reinforcements set up 
for rats in Skinner boxes, the discriminations based on fixed-interval rein- 
forcement programs that appear in laboratory rats do not always appear to 
be clear-cut in children. 

Thus, inconsistency in the application of reinforcement is the stuff of 
Which persistent behavior is made. How much an inconsistent application 
Of reinforcements (and punishments) is responsible for a generalized trait 
Not very much, it would seem, for even 


Of persistence we do not know. y : 
though parents differ enormously in the degree to which reinforcement and 


Punishment are consistently applied, the rest of the world is likely to be 
much the same in this respect. Each youngster, at least in the easygoing at- 
mosphere of American culture, is likely to have been exposed to a fairly 
Consistent pattern of inconsistency in the application of reinforcement. 
Partial reinforcement in choice and decision. A problem that has re- 
ceived considerable attention recently is that of finding out how irregular 
Patterns of reinforcement affect decisions. More generally, this has been 
Conceived of as a problem in the probabilistic nature of events that influence 
human and animal behavior. Brunswik (1954) for many years insisted that 
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the major variables in psychology were to be considered as only 4 
tic, not certain, in nature. For instance, a cue for visual depth, or ize; 0 
retinal image, does not always yield unequivocally SOIT information. 
Objects that loom large in our visual field are probably objects close to 5 
in space, but not necessarily so. So it is with reinforcements; they occur in 
a probabilistic way, seldom in an all-or-nothing way. . 

With this general view in mind, Brunswik (1939) reported an experi- 
ment on rats in a T maze in which most of the time reinforcements occurred 
on the right side but sometimes occurred on the left. Under these condi- 
tions, rats came to distribute their choices according to the probability of 
reinforcement. Thus, if the right side were reinforced 75 per cent of the 
time, and the left side only 25 per cent of the time, the rats would choose 
to go to the right side 75 per cent of the time. 

Of great current interest in the theory of behavior is what determines the 
likelihood of an organism making a particular choice given the probability 
but not the certainty of a particular event happening. Most experimental 
research has been directed toward verbal choice in adult human beings. In 
a typical experimental situation, a subject is presented with a panel on which 
are two lights and two buttons. He is to predict by pressing a button which 
of the lights is going to flash on a particular trial. The experimenter has ar- 
ranged a particular sequence of order for flashes on the right or left. As 
with Brunswik’s rats in the T maze, human beings apparently choose the 
alternatives in proportion to the frequency of the to-be-predicted event 
happening (Grant, Hake, and Hornseth, 1951; Estes and Straughan, 1954, 
etc.). This is a puzzling situation, for it is not the optimal strategy. We are 
not surprised, perhaps, that rats do not maximize the probability of receiv- 
ing a reinforcement, but we are Perhaps a little surprised that college stu- 
dents do not maximize the probability of being right (given a situation in 
which the frequency of alternative events is not even the best strategy is 


always to bet on the more frequently Occurring event, if we have no further 
information). 


It has been argued that this is a conse 


quence of the learning mechanisms 
(Estes and Straugh 


an, 1954; Bush and Mosteller, 1955), but if this is so 
itis an oversimplification. Hake (1955) has reviewed much of the literature 
On this topic and concluded that the conditions d 
a human subject making a given choice between two alternatives are influ- 
enced by several variables. There are reasons for believing, he points out, 
that in any such choice the subject is as much influenced by short sequences 
Of events just before a particular choice as he is by the over-all frequency 


etermining probability of 
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of past events. Also, to some extent, subjects are influenced by the nature of 
their previous choices. Hake points out that the matching of objective 
probability is a result; it does not necessarily describe what individuals do. 

Is this true of Brunswik’s rats? It is perhaps too early to arrive at a con- 
clusion. There are superficial parallels between the behavior of nonverbal 
subjects such as rats, pigeons, and young children under partial reinforce- 
ment, and the behavior of human adults in choice games with varying prob- 
abilities of making the right guess. Beyond this we can say little; we do not 
know if the parallels are anything more than superficial, and it is too early 
even to guess. We do know, however, that these experiments on human 
choice behavior that resemble certain experiments on partial reinforcement 
in animals so much are looking into important questions, and may eventu- 
ally have serious implications for all matters that involve human beings 


making decisions or choices. 


CHAPTER 4 


DISCRIMINATION AND LEARNING 


The chief function of learning for most animals in the world is to enable 
them to emit the right response at the right time. This generally means per- 
forming a response, on signal, that is already a part of the basic behavior 
of the animal. Most organisms seldom if ever find it necessary to learn 
skilled acts; much of the behavior (considered purely as response) of ani- 
mals is unlearned. They must frequently learn, however, under what cir- 
cumstances certain behavior is appropriate. It is the circumstances of emis- 
sion rather than the behavior per se that require learning in most animals. 
‘To be sure, in the laboratory we sometimes require rats to perform skilled 
acts, such as dropping marbles down a narrow chute, but this kind of thing 
is usually highly unnatural. More typically we take advantage of something 
a rat does more readily, such as exploring a narrow alley. Consequently a 
Sood deal of the learning a rat does in the laboratory involves simply the 
discrimination of stimuli in the environment and attaching appropriate re- 
sponses to these stimuli. Some Psychologists have said that in this sense all 
learning is discrimination. : 

Much of innate behavior, particularly in the lower organisms, is highly 
discriminative; some instinctive act is likely to be emitted only in the pres- 
ence of a rather precisely defined (unconditioned) stimulus. This has al- 
Ways been more or less understood, but in recent years we have greatly ex- 
tended our knowledge of the precision with which particular stimuli set off 
patterns of innate action. For example, in the three-spined Stickleback fish, 
a very crude model of a female will elicit the innate pattern of mating be- 
havior in the male, providing the model is Painted red on its underneath 
surface (Tinbergen, 1951). Here the critical feature of the stimulus seems 
to be the red color. 

In this chapter we shall look at the way in which animals learn to dis- 

74 


DISCRIMINATION AND LEARNING 75 


criminate. Since our main purpose is to understand the nature of learning 
— particularly as it applies to the human animal—we shall ignore some 
fascinating accounts of unusual discriminations in nature in favor of a de- 
tailed treatment of the process of learning to discriminate. We should note 
that this is not a process that is limited to learned stimuli; some uncondi- 
tioned stimuli have their effects modified through discriminative training 
(see Tinbergen, 1953). 

Before we look at discrimination itself, however, let us take a brief look 
at its opposite, generalization. Here we ask, How much variation can we in- 
troduce into a response-producing stimulus before the response disappears? 


GENERALIZATION 


While investigating the conditioned salivary response in dogs, Pavlov 
soon discovered that the conditioned response was not limited to the stim- 
ulus originally conditioned. It would spread to other, related stimuli. Ex- 
periments from Pavlov’s laboratory (Pavlov, 1927) established the fact that 
if tactile stimulation of a small area of a dog’s skin is used as a conditioned 
Stimulus, stimulation of other areas of the skin will also elicit the condi- 
tioned response, though the effect diminishes with the distance from the 
Original conditioned stimulus. This important effect is what is now known 
as stimulus generalization.t The two important points about stimulus gen- 


eralization are (1) that the effectiveness of conditioning is not limited to 


the stimulus originally conditioned, and (2) that the ability of a stimulus to 


elicit a conditioned response decreases with distance from the original con- 
ditioned stimulus. Since both of these points are basic to the psychology of 
learning, let us look at some experimental evidence about them. 

of classical conditioned responses. The experiments 
periments on the generalization 
however, look at some experi- 


Generalization 
from Pavlov’s laboratory are, of course, ex 
Of classical conditioned responses. Let Us, 
ments from laboratories that have used other classical conditioned responses 
and have more carefully controlled the quantitative relations between 


Stimuli. 
In one experiment, the investigators conditioned the galvanic skin re- 


Sponse 2 in man (elicited by an electric shock) to a vibrator applied near the 


1 “Trradiation” and “induction” are some of the other terms that have been used 


to describe stimulus generalization. 

* The galvanic skin response (GS. 
When they are excited. These glands are 
Of the autonomic nervous system; hence thi 


R) is the electrical activity of the sweat gland cells 
largely innervated by the sympathetic branch 
is response is an “emotional” one. 
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shoulders (Bass and Hull, 1934). The strength of the conditioned response 
was then tested with various stimuli placed at increasing distances from the 
original conditioned stimulus. The results of this experiment are shown in 
Figure 13. The conditioned galvanic skin reaction was greatest when it was 
nearest the original conditioned stimulus, and diminished in size as the 
stimuli increased in distance from the original conditioning site. 
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Figure 13. Generalization of a conditioned 
is the conditioned stimulus used duri 
conditioned stimulus only in location. 


galvanic skin response to different stimuli. Stimulus 1 
ing training. All stimuli used for testing differed from the 
(Data from Bass and Hull, 1934.) 

There have been many ex 
tioned galvanic skin response 
visual (Hovland, 1937a, 193 
etc.). Most of these studies 
ing the validity of the princ 
doubt about that but they 
of the generalization gradier 
is that there has been a wi 


periments on the generalization of the condi- 
to a variety of stimuli—auditory, tactual, and 
7b; Littman, 1949; Grant and Schiller, 1953; 
Were not performed with the idea of establish- 
iple of generalization— there seems to be little 
Were attempts to establish precisely the nature 
1. The reason for all this experimental activity 
dely accepted notion that there must be some 
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basic mathematical law governing the amount of stimulus generalization 
(Hull, 1950). Therefore a large number of experiments have been performed 
in an effort to discover the nature of this law. The sad outcome of all of 
these experiments suggests that there is not any such law. Rather, empiri- 
cally determined gradients of generalization take a variety of forms and 
have many different slopes. Some of the possibilities for these are illus- 
trated in Figure 14. 

Actually there are good reasons why an exact mathematical law of stim- 
ulus generalization has not been found. Some of these are purely mechani- 
cal. For example, even for the case of the galvanic skin response, there are 
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Figure 14. Some theoretical possibilities for generalization curves. Each curve would be expressed 


by a different mathematical equation. 


many ways in which the response can be measured—as electrical resistance, 
as current flow, as electrical potential, and, if a-c circuits are used, as other 
quantities. Unfortunately there is no rational choice for the basic measure- 
ment that should be employed. This precludes a fundamental mathematical 
law of stimulus seneralization for even so restricted a case as the galvanic 
Skin response. When we try to generalize a mathematical law to other re- 
Sponses, salivation, for example, the situation becomes even more hopeless, 


for here we introduce other arbitrary ways of measuring. 


Some theorists maintain that if probability of response occurrence is 


Used as the sole psychologically meaningful measure of behavior, we can 
then establish a mathematical law of generalization. The difficulty is that 


there is no unequivocal and psychologically meaningful way of stating the 
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units on the sensory dimension along which generalization is supposed to 
occur. Bush and Mosteller (1951) do suggest an index of similarity between 
stimuli, but they are unable to provide any way of rationally relating this 
index to the physical characteristics, such as size, distance, and intensity, 
that we use in measuring stimuli. Thus they are unable to suggest an exact 
mathematical statement of the principle of stimulus generalization. It is 
possible that this problem will soon be solved, but at present we must be 
content with the qualitative principles that stimuli do generalize their con- 
ditioned effects and that they generalize roughly to the extent that other 
stimuli are related to or resemble them. 

Generalization of instrumental conditioned responses. Generalization 
in classical conditioning is a relatively straightforward matter, since in clas- 
sical conditioning some stimulus is always used to elicit the conditioned re- 
sponse. In instrumental conditioning, however, we do not always attach a 
response to some particular stimulus, though it is usually perfectly feasible 
to do so. We can, for example, train a rat to press a lever in the Skinner 
box only upon signal. Such training ordinarily involves some discriminative 
learning, and generalization which comes after discrimination learning is 
quite different from that which comes before. Consequently in most instru- 
mental learning we look at generalization only after some discriminative 


learning has gone on; in order, however, to make a more exact parallel with 


classical conditioning, let us examine a case of stimulus generalization in 


instrumental conditioning in which there has been little or no discrimina- 
tion training. 


In this study (Grice and Saltz, 1950) rats were trained to run down an 
alley and obtain food by Pushing back a small door. The door was embed- 


ded in the stimulus to which the rats were trained; it consisted of a white 


circle on a black background. The rats were trained to run to a stimulus of 
a certain size, say, 79 square centimeters, and then were tested with circles 
of different sizes (for example, 50 square centimeters). The test consisted 
in finding out how many times the rats would run during extinction (no 
reinforcement). 


The results of the experiment are shown in Figure 15. As we can See, 
well-defined gradients of generalization occurred. The more remote the size 
of the test stimulus from the training stimulus, the fewer the responses dur- 


ing extinction. Thus it is clear that if a reinforcement is associated with 2 


particular stimulus during instrumental training, some of that association 
will “spill over” to stimuli related to, but not the 


চর Er same as, the stimulus 
originally conditioned. 
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The nature of generalization. The existence of gradients of stimulus 
generalization both for classical and instrumental conditioned responses is 
not in doubt. There has been, however, considerable argument over the na- 
ture of these gradients. Lashley and Wade (1946), for example, have ar- 
gued that they simply represent absence of opportunities for discrimination. 
In other words, they argue, if animals are not allowed to compare stimuli 
simultaneously, they find it difficult to discriminate between closely related 
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Figure 15. Generalization gradients for an instrumental response. The two curves show the mean 
to various test stimuli. The solid curve shows generalization 


and the dotted curve shows the same thing on 
1950.) 


number of responses during extinction 
after training on the 20 square millimeter stimulus, 
the 79 square millimeter stimulus. (Grice and Saltz, 


does not represent a spill-over of the influence 


Of habit so much as it does the failure to discriminate along some sensory 
that animals can learn to dis- 


dimension. The fact is, as we shall see later, 
criminate stimuli even when they are presented successively rather than 
Simultaneously, so any failure of discrimination in experiments on general- 
ization more than likely represents a failure of learned discrimination, not 
a failure in sensory capacity for discrimination. 

It is probable that the particular generalization gradient exhibited by an 
Organism under test in the laboratory, even if no specific discriminative 


stimuli. Thus, generalization 
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training has been given, represents the effects of the history of an organism 
with sensory learning. Even a laboratory rat that has been caged Asan lng 
eventful animal colony all its life has had some opportunity for discrimina- 
tive learning, and this probably has some effect on laboratory tests. We a 
not know much about these effects, but ever since the appearance of Hebb’s 
book The Organization of Behavior (1949), learning theorists have become 
increasingly aware of their existence. 

Extinction also generalizes. We have seen that the effects of reinforce- 
ment generalize to stimuli other than those specifically used in training. As 
might be expected, the effects of extinction also generalize in this way. Let 
us illustrate this with an example from classical conditioning (Hovland, 
19374). Suppose we condition a galvanic skin response in human subjects 
by pairing an electric shock with a tone Of a certain frequency. After con- 
ditioning we shall, of course, find that another frequency will elicit the re- 
sponse. Suppose we then extinguish the conditioned response to the second 
tone. If we then go back and test with the first tone, we shall find that ex- 
tinction of the response to the second tone also reduced the tendency to re- 
spond to the first. Thus the effects of extinction generalize as do the effects 
of reinforcement. 


The generalization of the effects of both reinforcement and extinction 
are extremely important in the theory of discrimin 
large portion of this chapter, as well as later chapters, will be devoted to the 
application of the Principles of stimulus generalization to an understanding 


of discriminative learning and of related questions in the psychology of 
learning. 


ative learning, and a 


DISCRIMINATIVE LEARNING 


Learning to make discrimina 
breaking down Eeneralizations. 


There are a Variety of speci 
learning in laboratory animals 
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doors, and these carry the stimuli. The problem illustrated in Figure 16 re- 
quires the rat to discriminate between vertical and horizontal stripes. If the 
rat makes the correct choice, the door falls and the rat is allowed to eat 
food placed inside the stand. If it makes the wrong choice, the door does 
not move and the rat falls to the net below. Thus, typically, the Lashley jump- 
ing stand makes use of punishment. 

In other types of discriminative studies single choice-point mazes and 


Figure 16. The Lashley jumping stand. The rat jumps from the stand to one door or the other. If 
the choice is correct, the stimulus card gives way; if the choice is wrong, the rat falls to the net 


below. (Lashley, 1935.) 


runways have been used. In order to study discrimination in animals not 
Ordinarily seen in the psychological laboratory, birds and reptiles, for ex- 
ample, special techniques must be devised. The fundamental rule remains 
the same however; animals are required to choose between a stimulus that 
is associated with reinforcement and one that is not (or is associated with 
Some unpleasant stimulus). 

The way in which animals learn such discriminations appears simple 
enough; with increasing frequency the correct alternative is chosen. If the 
Problem is easy the animal learns to make the correct choice in very few 
trials, and if it is difficult the animal may not discriminate at all. Usually 
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the difficulty of the problem is determined either by the closeness of the dis- 
crimination required or by the complexity of the stimuli presented. Despite 
the outward simplicity of the learning process, however, the events that go 
on in the organism must be very complicated. Let us look at some of the 
attempts to’ deal with this problem theoretically. 


Theories of Discriminative Learning 


Most current theories of discriminative learning fall into two classes. 
One of these places great emphasis upon the gradients of stimulus general- 
ization and upon the gradual accumulation of habit strength to the correct 
stimulus. The other class stresses the importance of the active problem- 
solving character of discrimination and the all-or-nothing characteristic of 
hypotheses that animals might employ in learning to discriminate. 

Unfortunately no single theorist of either class has dealt with all of the 
problems in discrimination learning. Therefore, in stating these theories, 
we must draw on the work of different theorists. By combining their views 
in this way, we probably do an injustice to the individual theorist. At the 
Present time, however, there seems to be no more satisfactory way of ob- 
taining a synthesis of these theoretical problems. 


The Algebraic Summation and Continuity Theory 


The theory of discriminative learning that makes greatest use of the con- 
cepts of stimulus generalization and of the gradual accumulation of habit 
strength is primarily the work of Spence (1936, 1937a, 1940, 1952), 
though most of his fundamental notions are close to those of Hull (1943, 
nd Let us look at some of the Principles these theorists have empha- 
sized. 

First Of all, these theorists have insisted that gradients of stimulus gen- 
eralization interact algebraically. Suppose that an animal is trained in 2 
discriminative problem by the reinforcement of response to one stimulus 
and extinction of response to another. The effects of reinforcement will 
generalize to all other related stimuli, and so will the effects of extinction. 
According to this view the strength of the tendency to respond is obtained 
by subtracting the strength of generalization of extinction from the strength 
of generalization of reinforcement for any particular stimulus. Thus the 
following equation expresses the notion: 

R= RI 
R, is the net strength of response after the inhib 


ition from extinction is sub- 
tracted. 
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This hypothesis is illustrated in Figure 17. Here, the solid curve repre- 
sents the generalization of conditioning R; the dashed curve, the generaliza- 
tion of extinction I; and the dotted one, the net response strength Re after I 
has been subtracted from R for every stimulus. The exact form of the gen- 
eralization curve, as well as the values of its constants, is, of course, un- 
known, but despite this limitation this idea of algebraic interaction leads to 


important conclusions. 


CONDITIONING 
+ 


EXTINCTION 


Er mT REE 


DEVIATIONS FROM STIMULUS REINFORCED 


Figure 17. Illustrates the algebraic-summation notion of discrimination learning. Net response 
strength is the result of the subtraction of the effects of generalized extinction from those of general- 


ized reinforcement. (Suggested from Hull, 1943.) 


Spence’s analysis of transposition. One of the most interesting by- 
Products of the algebraic summation notion is the way in which it can be 
Used to account for a special phenomenon in discriminative learning known 
As transposition. Transposition is the apparent ability of organisms to make 
relational discriminations. We can best illustrate what these are by example. 

Kohler (1915) once trained chickens to respond (with food as rein- 
forcement) to the darker of two gray surfaces. The chickens were never 
Teinforced when they responded to the lighter surface. When this discrimi- 
Nation was well established, the animals were presented with a new choice 
between the original reinforced gray and one darker still. The interesting 
thing was that the animals chose the darker of these two grays, even though 
they hag always been reinforced for choosing the other one. In other words, 
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argued Kohler, the animals had Jearned the relationship ee Fd 
rather than the association of the original stimulus with rein orcemt ত 

This result presents a paradox for the theory of learning, for HES i 
case in which reinforcement apparently has no effect (or even a negative 
effect) on choice. It turns out, however, that this example of OATS BORILNION 
can be explained by the principle of stimulus generalization. Let us turn 

nce’s explanation. | 

li - ক CU, 19376) uses an illustration based on a study of te as 
crimination of visual size in chimpanzees. He trained animals to discrimi- 
nate between two squares, 256 square centimeters and 160 square centi- 
meters in size. The response to 256 was reinforced and the response to 160 
was extinguished. Then, after the discrimination between stimuli 256 and 
160 was established, the values of the stimuli were shifted to 256 and 409. 
The animals chose 409, a stimulus which had never been reinforced. 

The theoretical analysis of this problem is presented in Figure 18. The 
extinction to stimulus 160 generalized to 256 to some extent; furthermore, 
the conditioning to stimulus 256 generalized somewhat to 409. To arrive 
at the comparative value of response strength after discrimination of 256 
and 409, we subtract the amount of generalized extinction from the amount 
of generalized conditioning. As shown in Figure 18, when the net response 
strength is obtained by this procedure, it is greater for 409 than for 256. 
Thus, we would predict that, on the average, animals would choose to Te 
spond to 409 rather than to 256. This is a prediction of the transposition 
effect. | 

The generalization curves in Figure 17 do not yield the transposition 
effect; in this figure the highest net response strength occurs to the stimulus 
Originally reinforced. From a comparison of this figure with Figure 18, we 
can infer that the prediction of whether or not transposition will occur de- 


pends upon the shape, relative location, and slope of the generalization 
curves. This latitude offered by the selection of arbitr 
ably to be determined experimentall 
retically, 


ary values (presum- 
Y) for curves of generalization is, theo- 
both an advantage and a weakness. It is a weakness in that the 
theory can provide no rational ® Way of exactly 


stating the quantitative as" 
pects of its assumptions; it is 


an advantage in that it suggests that trans- 
position will occur only sometimes. The gestalt notions from which the 


transposition problem originally comes imply that transposition will always 
Sg f 
3 Spence (1942) does present some empirical constants for a particular form © 


generalization, but since we have already seen something of the variability of obtain 
generalization curves, we cannot stretch his curves to fit many cases. 


DISCRIMINATION AND LEARNING 85 


occur. The theory based on generalization gradients suggests that occurrence 
depends upon how we affect the generalization gradients by training, etc. 
The obvious way to prevent transposition is to make the positive and 
negative stimuli very different from each other. A demonstration of what 
this would do can be seen by moving the curves in Figure 18 apart. In gen- 
eral, experimenters have found that when they move stimuli far apart, there 
is a failure of transposition (Spence, 1937a; Webb, 1950). Two studies 
(Kendler, 1950; Ehrenfreund, 1952) have systematically explored the 
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notion accounts for an example of “transposi- 


Figure 18. Illustrates how the algebraic-summation 
not at the stimulus size specifically reinforced 


tion behavior. The net response strength is greatest, 
(256), but at larger ones (409, etc.). (After Spence, 19370.) 


effects of increasing the difference between stimuli, and the results are what 
Would be expected from generalization gradients—the percentage of re- 
Sponses showing transposition decreases. 

Several experimenters have studied the problem of transposition in chil- 
dren (Kuenne, 1946; Hunter, 1952; Alberts and Ehrenfreund, 1951). Ar- 
guments have been advanced to the effect that as children pass from pre- 


Verbal to verbal stages their solution of the transposition problem changes 
gradients to one more like adult for- 


Kuenne, 1946; Stevenson, Iscoe, and 
ts that it is so, but it is an in- 


from one based upon generalization 
mation of “relationship” concepts ( 
McConnell, 1955). No critical evidence exis 
teresting hypothesis, and we shall encounter it later. 
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Despite the considerable merits of the stimulus-generalization derivation 
of transposition, we should not lose sight of the fact that sensory experi- 
ments have conclusively demonstrated our perception of objects to be de 
termined by the context in which they are embedded (gray compared with 
black looks different from the same gray compared with white). In this 
sense, even discriminations based on the subtraction of generalization Era 
dients should be relational; i.e., the generalization gradients themselves will 
be modified by perceptual relationships between the stimuli. Finally, rats 
will transpose more readily when the stimuli are presented simultaneously 
(Baker and Lawrence, 1951). Furthermore, rats will unambiguously make 
relational choices when they are presented with stimuli that clearly imply 
relationships. For example, relational choices are more likely if the stimuli 
are of two brightness levels rather than one (Lawrence and DeRivera, 1954). 

Other uses of the algebraic summation principle in generalization. The 
basic notion used in the prediction of transposition from generalization 
gradients, namely, the algebraic summation of generalized habit tenden- 
cies, has many other applications. It has been generally assumed by those 
working in the psychology of learning that stimulus generalization is a 
basic component in the ability of organisms to transfer what they have 
learned from one situation to another, similar one. The relationships be- 
tween stimulus generalization and more general problems in the transfer 
Of training involve many difficulties, however, 
cussion of some of these. Part of the difficulty, 
specify the exact form of 


and we must postpone a dis- 
of course, is our inability to 


generalization gradients. It should be noted that 
Some progress is being made; Lawrence (1955), for example, has pointed 


Out certain important restrictions that must be incorporated into generaliza- 
tion gradients if they are to account for the existing data on discriminative 


learning. Lawrence suggests that only certain types of concave downward 
gradients are adequate (see Figure 14). 


The Noncontinuity Theory 


The theory of discriminative learning we have just looked at places great 


emphasis upon the gradual accumulation of habit strength and the algebraic 
summation of gradients of generalization based on reinforcement and ex- 
tinction. In general this theory has been contrasted with one which empha- 
sizes the problem-solving behavior of Organisms in discriminative learning 
This is generally called a noncontinuity theory, in that it implies that learn- 
ing a discrimination is not a continuous accumulation of positive and nega- 
tive habit strength. The essential idea is that animals try out hypotheses 
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about the discriminative problem they are required to solve. Let us see 
how this might work. 

A rat is trained to jump to one or another window in the Lashley jump- 
ing stand. The problem for the rat is to learn to jump to the white window 
and to avoid the black one. When initially faced with a choice between the 
white and black, the white window might by chance be on the right side. 
If the rat jumps successfully to the white window, it might learn either 
that white is correct or that the right side is correct. Let us suppose that 
it learns right to be correct. The rat will then work on this hypothesis until 
it is either punished or extinguished. The rat might then adopt some other 
hypothesis, based upon a chance contingency of reinforcement and some 
aspect of the environment. This would continue until the rat happened on 
the correct hypothesis and kept at it long enough to convince the experi- 
menter that the rat had solved the problem. 

In other words, instead of gradually learning a single habit through the 
accumulation of the effects of reinforcement, the rat would have learned 
(and unlearned) a whole series of habits analogous to what human beings 
call hypotheses. 

This notion—that animals adopt hypotheses in solving problems—has 
been advanced by many investigators, most notably Krechevsky (1932) and 
Lashley (1938, 1942; Lashley and Wade, 1946). The noncontinuity theory 
has never been entirely expressed by one author, and many more points 
than the one about hypotheses have been made. Lashley, for example, in 
nearly all his publications emphasizes the momentary attentional set the 
animal may have in perceiving a portion of the stimulus field set in front 


Of it by the experimenter. 

The theorists who have sup 
tended to use experimental problems that arise out 
Own experience. The continuity theorists, on the oth 
to develop experimental hypotheses more rigorously from formal (and 
Sometimes mathematical) principles. In the light of this difference in ap- 
Proach it is not always possible to compare the continuity and nonconti- 
uity theories; but let us look at some of the experiments designed for this 


ported the noncontinuity viewpoint have 
of intuition and their 
er hand, have tended 


Purpose and see what the outcome has been. 


Comparison of the Continuity and Noncontinuity Theories 


The effects of changing the problem. est ( ont 
Comparative adequacy of the continuity and noncontinuity theories has 


Come from experiments in which a problem is altered part way through 


Some of the best evidence on the 
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the training program. This has most frequently taken the form of reversing 
the discriminative cues. Suppose that we start to train a rat to discriminate 
between black and white cards. We reinforce the black card and extinguish 
the white one. Then after a few trials, we reverse the cues, so that the white 
card is now associated with reinforcement and the black card is extin- 
guished. What effect will this have on the animal’s eventual rate of learning 
the problem? If the animal had been following an erroneous hypothesis (a 
position hypothesis, say) during the initial phase of the training, we should 
expect no effect if we reversed the cues, since the animal had not yet begun 
to associate the black and white cues with the reinforcement and its absence. 
Tf, however, each trial adds a small increment both of habit strength to the 
positive stimulus and of inhibition to the negative stimulus, as the conti- 
nuity theory would suggest, then reversing the cues ought to lead to nega- 
tive transfer or interference. 

Quite a few experiments which use the technique of reversing cues dur- 
ing the presolution period have been reported. Nearly all of these show 
that reversing the cues does retard learning (for example, McCulloch and 
Pratt, 1934; Ehrenfreund, 1948; Ritchie, Ebeling, and Roth, 1950; Grice, 
1951). If cues are reversed every few trials, animals eventually come to do 
better on successive reversals. Figure 19 shows that there is interference 
(reversals result in more errors) the first few times a pair of stimuli is 
reversed. After several reversals, however, animals come to do better than 
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Figure 19. The number of errors made on successive 

each problem increases rapidly at first, 
the initial problem. (Gatling, 1952.) 


Problem reversals. The number of errors on 
but after a few reversals, errors are less frequent than on 
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they would have done with no reversals at all (Gatling, 1952). This is much 
like the “Jlearning-set” phenomenon discussed below (also see North, 1950). 

It is clear, then, that the results of discriminative experiments in which 
the cues are reversed favor the continuity view. In addition to these ex- 
periments, there are some in which cues are modified rather than reversed 
during the presolution period and still others in which more than one kind 
Of cue is used. Let us look at some of these experiments. 

One that seems to favor the noncontinuity view is an experiment by 
Lashley (1942). Lashley first trained some rats to discriminate between 
large and small circles (the large circle was reinforced). Then Lashley 
continued the training, but this time he substituted a large triangle for the 
large circle. Thus the animals now discriminated between a large triangle 
and a small circle. Yet, these animals failed to make the distinction in shape, 
for when they were tested with the large triangle and the large circle they 
Showed no preference. Furthermore, they showed a preference for a large 
Circle over a small triangle. The results of this experiment suggest that the 
rats did not learn the incidental cue (shape), but only the cue they had 
been originally trained on (size). 

On the other hand, there are experiments which suggest that rats do 
learn such incidental cues. For example, Bitterman and Coate (1950) 
Showed that if a spatial cue is made incidental to the learning of a bright- 
atial cue is learned also. It is obvious that this 
ome experiments that we shall examine 
iling the findings that sometimes inci- 


Ness discrimination, the sp 
question is by no means settled. S 
later will suggest a way of recone 
dental cues are learned and sometimes they are not. 

Successive versus simultaneous discrimination. Another comparison of 


theoretical importance in discriminative training is that between simultane- 
Ously and successively presented cues. (It is not always clear which theory 
is on which side, however. Compare Grice, 1949, with Spence, 1952.) The 
Problem is especially important to the matter of relational versus absolute 
discrimination. If animals respond to relationships between stimuli in 
learning how to discriminate, then successive presentation of stimuli should 
Make the problem more difficult than simultaneous presentation. If, how- 
ever, animals respond simply to the positive reinforcement of one stimulus 
and to the extinction of the other, it should not make much difference, all 
other things being equal, whether the stimuli are presented together or not. 

One of the best experimental tests of this problem is by Grice (1949). 
It is difficult to compare the learning which occurs to simultaneously pre- 
sented stimuli and that which occurs to successively presented stimuli be- 
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cause in the latter there are no errors, since the animals have no choice to 
make. Grice, however, estimated the errors in successively presented stimuli 
by counting as errors all runs to the positive stimulus with latencies greater 
than the median latency and all runs to the negative stimulus with latencies 
less than the median. With this estimate, there was no difference in the 
learning curves for the two conditions. Thus, Grice concluded, it is not 
necessary for animals to compare stimuli in order to learn a discrimination. 

Spence (1952), however, argues from the absolute theory of discrimina- 
tion that under certain conditions simultaneous presentation should be 
easier for animals than successive presentation. He cites data from his 
laboratory and disputes evidence advanced by Weise and Bitterman (1951) 
showing simultaneous presentation to be more difficult. 

The problem is a complicated one. It is clear, as Nissen (1950) points 
out in a paper discussing earlier experimental work, that discrimination is 
to a large extent the establishment of approach tendencies to positive 
stimuli and of avoidance tendencies to negative stimuli. The experiments 
On successive versus simultaneous discrimination, however, show that the 
learning of even a moderately simple discriminative habit is too involved to 
be easily explained by any existing theory. The experiments by Bitterman 
and others are complicated, as Spence has remarked, by the existence of 
patterning between positive and negative stimuli; the extent to which this 


s A EEE) . . . s 1 jl 
influences ease of discrimination is unknown. One respect in which Grice’s 
experiment is different from the others is th 


lus patterning. Thus we should probably 
Strating that absolute tendencies to approach or avoid stimuli can be 


established, though relationships between stimuli may be fundamental to 
real-life animal discrimination. 


An evaluation of the two theories. 
experimental data on discriminative le 
tainly the absolute, continuity theory 
can be trained to have positive respo. 


paring these stimuli to others on the sa 
of training, 


at it is not complicated by stimu- 
accept his experiment as demon- 


It should be evident by now that 
arning have far outrun theory. Cer- 
is correct in asserting that animals 
ses to certain stimuli without com- 
me sensory continuum. In the process 
even though animals give no evidence of having grasped the 
discrimination, experience at the task has effects which can be detected 
by negative-transfer tests. On the other hand, there is also abundant evi- 
dence that animals do not simply react to any stimuli that come within 


range; they are selective in what they attend to. Furthermore there is good 


evidence that animals test preliminary “hypotheses” about the nature of the 
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Pproblem.* This, of course, is reflected in the experimental literature; this is 
Why there are no unequivocal data to support the extant theories. From the 
Vantage point we have at the moment, it appears as though an adequate 
theory will be a synthesis of many current notions. The idea of stimulus 
Seneralization, or something like it, is important and will probably be re- 
tained as an essential element in theory making. In addition there will 
Probably be notions describing more adequately the selective nature of 
attention and hypothesis in discriminative learning. 


THE DEVELOPMENT OF DISCRIMINATIVE CAPACITY 


An important problem that has received.much experimental attention is 
that of the development of the capacity for discrimination. This has been 
Studied in two ways. In the first place, there has been much experimental 
Work on the acquisition of distinctiveness of cues in discrimination, and 
Secondly, there has been an elaborate experimental program to study the 
gradual improvement in discrimination that comes from learning how to 
discriminate. Let us first consider the problem of acquiring distinctiveness 
Of cues. 

The acquired distinctiveness of cues. Much of the theoretical analysis 
Of discriminative learning has incorporated the assumption that cues are 
always learned to particular responses—that learning is a simple matter of 
Stimulus-response bonds. Nearly all theorists agree, however, that at best 
this is an oversimplification, and recently there have been many experiments 
designed to show that discriminative learning involves some mediated or 
Central associations. Some of the best of these experiments have been on 
the acquired distinctiveness of cues. 

Lawrence and his associates (Lawrence, 1949, 1950; Bauer and Law- 
rence, 1953) have established that when animals learn to discriminate 
between cues, they can transfer this discrimination to new situations in 
Which entirely different instrumental responses are required. Thus, the dis- 
tinction between cues is not correlated exclusively with the responses used 
g, and therefore discrimination learning cannot be 


In discrimination learnin ত ঠ 
bonds. Thus, in addition to consider- 


Simply a matter of stimulus-response 
comes from a phenomenon called “Vicarious Trial 


t that animals engage in vacillation and indecision 
Woodworth and Schlosberg (1954) have 


ny Much of the evidence for this 

ন Error. * This refers to the fac A 
Choice points in discriminative learning. 

Presented a good review of the effect. 
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ing the factors of positive and negative stimulus generalization in any oo 
plicated learning problem, we must also consider the extent to whic 
particular cues acquire distinctiveness for animals (see Bauer and Lawrence, 
se outcome of this work has been to point toward the solution 
of some of the issues in the continuity-noncontinuity controversy. Lawrence 
and his associates (Lawrence and Mason, 1955; Goodwin and Lawrence, 
1955) have suggested that in complicated discrimination problems, ani- 
mals can learn several habits simultaneously. First of all they can learn to 
“orient” toward a particular kind of stimulus (black versus white cards, 
etc.), and secondly they can learn the actual discrimination (white is Ie 
forcing). Suppose, however, there are other cues present, such as the height 
of a hurdle over which the animal must jump. If the experimenter changes 
the problem so that the height of the hurdle is the cue, the animal need of 
necessarily unlearn the black-white discrimination. All that is necessary is 
that the orientation towards brightness be extinguished. If this is so, the 
animal can learn the new problem without disturbing the old discrimination. 
Thus, in complicated discrimination situations it is possible to obtain be- 
havior in which the rat appears to engage in particular hypotheses without 
disturbing other hypotheses. Notice that this derivation of a complex effect 
in discrimination learning is built upon the simple mechanisms of reinforce- 
ment theory. Lawrence has devised many ingenious experiments which 
Point to the way in which problem solving and relational discrimination can 
be derived from simple mechanisms. The most important step is the aban- 
donment of the simple stimulus-response paradigm. Once we grant the 
Possibility of connections between stimuli and nonovert mediated processes 
in the animal, it becomes possible to derive complicated effects in discrimi- 
nation learning from simple premises. 

Reid (1953) has presented the s 
learning a discrimination animals a 
discriminative learning, 


ame notion, namely, that in addition to 
1so learn something about the nature of 
and this can account for the development of non- 
continuity behavior out of continuity-like Processes. Reid demonstrated the 
adequacy of this idea by showing that there Was more rapid learning of a 
discrimination reversal after a long period of overlearning on the original 
problem. Reid points out that Spence and others have suggested the need 
for a receptor-orienting act, or something like it. In simpler discriminative 
experiments receptor Orientation, or learning to notice particular cues, 1S 
unimportant because only a limited number of cues are presented to the 
animal. When conditions become more complicated, however, this orienting 


DISCRIMINATION AND LEARNING 93 


becomes a more important determinant of behavior. It is significant that the 
experimental evidence shows this orienting is learned in much the same 
Way that other kinds of discrimination are learned. 

Discrimination-learning sets. The examples of discrimination discussed 
thus far have been those in which animals obviously have had some diffi- 
culty in learning. Yet it is possible to find examples of discrimination in 
animals in which there is practically no learning, because the animal solves 
the problems immediately. In some experiments, for example, two stimuli 
are presented to monkeys; one of the stimuli, say, is a red cylinder and 
the other is a blue pyramid. Under one of these objects is hidden a peanut. 
When these stimuli are exposed, the monkey carefully examines each and 
Chooses to look under one. If, by chance, it looks under the wrong one, on 
the very next trial it looks under the other one. The discrimination is per- 
fect, and it is only a matter of chance whether the monkey discovers the 
Peanut on the first or on the second trial. 

Immediate and perfect solution of this and much more difficult problems 
Comes from a history of learning to make discriminations of this sort. As 
animals solve successive discriminative problems, they show an orderly 
And gradual improvement in this ability. Such an improvement in ability 
Harlow has called the formation of learning sets (Harlow, 1949). The 
demonstration of this gradual improvement is extremely important because 
it Suggests a direct continuity between the most primitive examples of dis- 
Criminative learning and the insightful behavior of sophisticated animals. 
One Kind of behavior arises out of the other. 

An analysis of the way in which this happens has just begun. One im- 
Portant point has been made (Riopelle, 1953)—transfer from problem to 
Problem based upon stimulus generalization of cues associated with success- 
ful solution becomes less and less as additional problems are solved. In 
Other words, animals learn that they cannot rely on stimulus generalization 
from one problem to the next to lead them to rapid solution of successive 
Problems. This means that stimulus generalization of the sort we discussed 
Carlier either becomes completely suppressed or radically altered in char- 
Acter, Warren (1954) points out that test-wise monkeys approach each 
tecombination of stimuli as a new problem however many times the specific 
Stimuli required in the discrimination had been differentially rewarded. 
Thus, even if one of the stimuli were much more frequently rewarded than 
the other in the past, the animal would not tend to choose this stimulus 
more often. Warren suggests that this means that the analysis of discrimina- 
tive learning according to the excitatory and inhibitory strengths built up 
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to particular stimuli as the result of reinforcement and extinction does not 
apply to animals that have learned many problems. 

Thus as a result of long experience at discrimination, animals learn some- 
thing new. They are no longer limited to the information that red objects 
Or square objects are associated with reinforcement; they learn that one 
of two stimulus objects will be reinforcing or that one that was not reinforc- 
ing on the previous trial will be reinforcing on the next trial. Such learning 
is truly the solution of a problem, and we shall reexamine discriminative 
learning sets when we consider the role of learning in problem solving. 


PERCEPTUAL LEARNING 


The influence of learning on perception. Almost as old as philosophical 
inquiry itself is the question, Is man’s ability to perceive objects of the 
world innate or is it the result of learning? In modern Psychology this ques- 
tion had long been dormant because nobody seemed to be able to suggest 
a reasonable or even testable solution to the question. In recent years, 
largely because it was raised in a new context (Hebb, 1949), there has been 
a revival of interest in the matter. Specifically, the question has been raised 
as to whether the normal visual and tactual perception of shape and pattern 
depends upon learning or whether it is an inherent property of the nervous 
system. 


In the first experiments aimed at ex 


Ploring this question, apes were 
reared in d 


{ arkness from birth to the time of testing. These apes, deprived 
Of all visual experience, were tested on vi 
results showed an astounding deficiency in visual form perception; these 
findings were not conclusive, however, because there was evidence of a 


deterioration in the physical capacity of the eyes because of the long period 
of disuse (Riesen, 1950). 


sual discrimination problems. The 


Several subsequent experiments have confirmed the earlier observations, 
and in these the Possibility of deterioration of the sense organs was fairly 
well ruled out. In one (Siegal, 1953), doves were reared with translucent 
head covers. These covers permitted light to come through (thus not de- 


priving the retina of stimulation) but did not allow definition of visual form. 
Birds reared with these covers took much 


pS Ee longer to learn simple form 
discriminations th 


an did normal birds. Tt seems certain that experience was 
a normal prerequisite for visual Perception in these birds, though there is 


some question as to whether a lack of Visual-motor coordination was not 
the basic factor. 
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In another experiment (Nissen, Chow, and Semmes, 1951) a chimpanzee 
Was reared deprived of tactual, kinesthetic, and manipulative experience. 
The results were much the same as those found in the experiments with 
the ring doves. This animal was grossly inferior to normal ones in the abil- 
ity to learn tactual discriminations. Again it is not clear whether the depriva- 
tion in experience resulted in a purely sensory deficiency or whether the 
deficiency was one in sensory-motor coordination. 

These experiments point to the great importance of experience in the 
Performance of even the most simple perceptual-motor acts. The deficiencies 
that result from deprivation are not the result of failure to learn how to 
discriminate generally, since the animal is free to perform discriminations 
in its other senses. The deficiency is specific to the sense deprived. If it turns 
Out, as Hebb believes, that the deficiency is a result of the failure of the 
nervous system to learn the nature of forms in the perceptual world, then 
the role of infantile learning is enormously more important than anyone 
has ever thought. There is strong incidental evidence to support Hebb’s 
Views, since it is difficult to understand how the perceptual constancies 


Could arise unless perceptual relations are learned. 


Response versus Place Learning 

Experiments on infantile sensory deprivation and perception suggest that 
learning of perceptual relationships can occur without the intervention of 
motor acts. Other sources supply good evidence for this. As we have seen, 
Some of the experiments on complex discriminative learning suggest that 
hl is necessary to assume that some discriminations are learned without be- 
Ing attached directly to motor responses. In the preceding chapter it was 
also pointed out that instrumental conditioned responses could be extin- 
Buished without the performance of the originally learned response. Another 
Problem arising from this question of pure perceptual learning concerns 
the nature of what animals, particularly rats, learn in mazes. A pure stimu- 
lus-response theory would say that rats learn chains of discriminations 
attached to chains of responses. Thus, at a particular signal, the rat learns 
to turn right. According to this view, rats would never really learn locations, 


Only responses. 


A considerable literature exists, h 
do learn more than simple chains of responses; they do indeed appear to 


learn locations. In one experiment (Tolman, Ritchie, and Kalish, 1946), 
Tats were trained to run a rather peculiar maze. After the rats had learned 
the pattern of the maze, new pathways were supplied and the old, direct 


however, purporting to show that rats 
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route to the goal was blocked. Most of the animals chose new pathways, 
which suggested they had learned the general spatial location of the goal. 
In another experiment, Galanter and Shaw (1954) showed that place learn- 
ing is demonstrated in rats if there are many environmental cues, while 
an earlier study (Thompson and Thompson, 1949) suggested that place 
learning is absent if spatial cues are minimized. 

It is clear from these and other studies that rats can learn “places” as 
well as chains of stimulus-response associations. It is probable that the 
highly organized perceptual learning implied in the learning of “place” 
relies on the gradual development of more primitive discriminative learning. 
It probably depends upon the growth of perception implied in the experi- 
ments on early sensory deprivation. It also probably hinges on the de- 
velopment of “learning sets” out of multiple experiences with simpler 
stimulus-response discriminative problems. Thus the learning of “place” 
would appear to be a higher-order example of perceptual learning. 

Tolman (1932, 1951) has long insisted upon the importance of perceptual 
learning in the behavior of animals and men. Despite Tolman’s belief, there 
has been relatively little analytical work directed toward finding out about 
the characteristics of this kind of learning. Perhaps an extension of the work 
On acquired distinctiveness of cues and on learning sets will help us to 
understand Perceptual learning more adequately. 


SOME CONCLUDING REMARKS 


Discriminative learning is an important and fascinating business, and 
there are many Problems we have not discussed. For one thing we have 
not said much about discriminative learning in adult human beings. Many 
of the experiments on animals have been duplicated at the human level. 
For example Loess and Duncan (1952) showed that successive presenta- 
tion of stimuli generated Poorer learning than simultaneous presentation, if 
the discrimination was difficult. Another investigator (Walk, 1952) showed 
ie reversal of a discriminative Problem resulted in a decrement in human 
Just as in animal subjects. Despite the fact that the outcomes of these 
experiments are similar to those conducted with animals, we suspect that 
discrimination in human beings is much modified by verbal and symbolic 
Processes. In a later chapter on problem solving we shall return to some of 
these questions concerning human discriminative learning. 

More important is the demonstration of gradients of stimulus generaliza- 
tion in voluntary human behavior. Stimulus generalization is basic to the 
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ability of organisms to transfer what they have learned from one situation to 
another. The relationships between stimulus generalization and even simple 
transfer are quite complicated, however, and we must postpone a discussion 
of these relationships until other important factors in human learning have 
been examined. Suffice it to say at present that stimulus generalization of 
Verbal responses to symbols and stimuli has been well established in human 
learning (Yum, 1931; Gibson, 1941) and in motor behavior (Brown, Bilo- 
deau, and Baron, 1951). These demonstrations are important, for they enable 
Us to make confident use of the notion of stimulus generalization in human 
learning. 

Unfortunately, there are many other questions about discriminative learn- 
ing that we shall have to ignore. One of the most interesting concerns the 
relation of capacity for discriminative learning to the natural behavior of 
animals and their mechanisms of survival. If this were a book about animal 
behavior rather than a book about the processes of learning, we should 
Study this question in detail. Let us, for now, take an example of the im- 


Portance of discrimination in natural animal life as an indication of the 


Tole of these factors in survival. 
There is a variety of caterpillar that survives because of the keen discrim- 


inative learning of its natural enemy, birds. This caterpillar, the larval stage 
Of the Cinnabar Moth, is brilliantly striped in black and yellow. Every young 
bird tries one of the caterpillars, and rejects it because, apparently, it is 
highly distasteful (Windecker, cited in Tinbergen, 1953). Thereafter the 
birds leave this caterpillar alone. Thus, the caterpillar sacrifices some of 


its kind in order to instruct birds in a discrimination highly useful to itself. 


Certain species of insects, moreover, mimic the appearance of distasteful 


insects, and therefore rely, to the extent they resemble the creatures they 
(Tinbergen, 1953). 

ea in natural history in which the notions 
derived from laboratory experiments on discriminative learning might well 
be Put to use. There is, indeed, a recent movement to put to use in wildlife 
Preservation and control some of the principles obtained from the laboratory 
Study of animal behavior (Smith and Geis, 1956). 


mimic, on stimulus generalization 
As you can see, there is a vast ar 


CHAPTER 5 


MOTIVATION AND LEARNING 


For a long time psychologists have made a distinction between the things 
we learn, our habits, and the things which prompt us to use these habits, 
our motives. Motives provide the internal impetus behind behavior (needs 
Or drives) and the direction the behavior takes (goals); learning provides 
the means by which the organism finds a Way to reduce or eliminate need 
by coming into contact with an appropriate goal. 

Motivation is a construct, not something directly observable in animal 
behavior. In other words it is an idea we invent to describe the impetus that 
makes animals active. Presumably it would be possible to invent a satisfac- 
tory theory of behavior that never mentioned the word motive or another 
word like it, but no such theory exists at the present time, and, indeed, 
Psychologists have used motivational constructs very freely. Nearly all 
Psychologists agree that it is necessary to make use of a construct that 
emphasizes the energy behind behavior. In this chapter we shall discuss the 
relationships between the motivational constructs and learning. In doing 50; 


we shall view motivation as the energizer of behavior and learning as its 
guide. 


MOTIVATION AND PERFORMANCE 


Motivation has at least two fundamental components, the need state and 
the goal, or external stimulus, that tends to eliminate or reduce that need. 


That is to say, for nearly every need there is some goal that will satisfy or 
reduce the need. Needs may be cyclic, or periodic 


case they usually depend upon changes in the intern 
of the organism. Hunger and thirst are such cyclic 
98 
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governed by internal changes, however. Some exist at a more or less con- 
stant level, awaiting only the occurrence of the appropriate external stimulus 
to set off the appropriate behavior. The avoidance of pain is such a need, 
and in all animals there are many other such needs, needs we ordinarily 
characterize as instincts. 

When a need is aroused, either by internal changes or an appropriate 
external stimulus, the result is usually that the animal performs some be- 
havior designed to reduce or eliminate the need. Thus, when an animal 
finds its leg in the jaws of a trap, it struggles to free itself; or when it is 
thirsty, it finds its way to a familiar watering place. A good many, though 
not all, of these patterns of behavior are learned. Furthermore, animals 
learn to anticipate these need states before they actually occur, and to act 
accordingly. Thus, animals learn to avoid traps, and they sometimes find 
their way to food or water when they are not hungry or thirsty, or at any 
rate they seem to learn to stay in the vicinity of water and food supplies. 

Activity and motivation. The classical theory of needs says that they 
arouse animals to activity. As a matter of fact, there are some well-estab- 
lished relationships between needs such as general hunger and activity. 
Activity increases directly as a function of the number of hours that rats 
have been deprived of food (Siegal and Steinberg, 1949). There is some 
Point, when the deprivation becomes severe enough, at which activity drops 
off (Skinner, 1938). Rats, when allowed to exist upon a free feeding sched- 
ule, will show regular cycles of activity and eating. A period of quiescence 
is followed by some activity which culminates in eating (Richter, 1927). 


Eating itself is followed by a period of subnormal activity (Finger, 1949). 


It has generally been assumed that all of these effects associated with the 


arousal of need for food are direct, unlearned effects of the physiological 
Consequences of deprivation. Recently, however, several investigators have 
Suggested that the increased activity and other things associated with hunger 
Are partly the result of learning. The cycle of quiescence-activity-eating- 
quiescence, for example, does not occur in rats when their environment is 
Carefully held constant (Baker, 1954). Furthermore, activity can be in- 
duced by sudden changes in environment (Campbell and Sheffield, 1953), 
and there is strong evidence that such activity is, in part, the result of a 
discrimination based upon the cues associated with eating (Sheffield and 
Campbell, 1954). Some of the ordinary activity-arousal effects of such 
leeds as hunger are partially the result of learning and experience. This is 
One demonstration of the fact that the behavior patterns aroused by needs 


Are modified by learning. 
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It is well known that other conditions of need, such as sex, EE 
lowered temperatures, have specific and general influences upon hl jl 
That is to say, there are many different organic conditions that ae | 
changes in the general activity level and that induce animals Eo per লৰ 
both unlearned and learned acts specific to each organic condition. 2 
of these specific effects are not of critical importance to the psychology 0 
learning, however, and we shall largely ignore them. 


200r- 


200oF- 


RESPONSES PER HOUR 


© 


Ls 
[e) 2 4 € 


GRAMS OF FOOD PREVIOUSLY EATEN 


just 
Figure 20. Rate of responding under Partial reinforcement as a function of amount of food i 


eis 
eaten. The lower curves are based Upon averages of data from four rats each. The upper curv 
An average of the lower curves. (Skinner, 1938.) 


Performance of learned acts and needs. 
tivated animals in almost any kind of learne 
mistakes and in general perform less adequat 
after rats have been thoroughly trained in a 
alley that leads to food only about 50 per cent of the time if they are not 
hungry (Strange, 1950). Similarly, the rate at which rats in a Skinner 2 
press the lever depends upon their state of hunger at the moment. Figure 2 
(Skinner, 1938) illustrates this point. It shows the rate of responding after 
the rat has eaten various amounts of food. The more food the rat eats be 
fore the experiment, the lower its rate of response. This general effect is also 


As we might expect, unmo- 
d task will make many more 
ely than motivated ones. Even 
T maze, they will choose the 
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true whether or not the rat is reinforced. Figure 21 shows the rate of 
responding during extinction. We can see that the larger the amount of 
food eaten by the animals before extinction, the lower the initial rate of re- 
sponding during extinction. 

This increase in the level of goal-oriented behavior with an increase in 
the basic need seems to be generally characteristic of needs. It is true of 
thirst as well as hunger, and probably other needs as well. Thus, it appears 
that a need state and a reinforcement are both necessary for the emission 
Of a learned instrumental response. We shall consider the implications of 
this for the theory of behavior shortly. 
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Figure 21. Examples of extinction curves for the lever-pressing habit in rats. The number shows 

how much food (in grams) was fed to each rat prior to extinction. Notice that the initial rate during 
extinction is much higher after no food than after 6 grams of food. (Skinner, 1938.) 


Before we consider other topics, it is well to settle one important question 
about the influence of need upon learned behavior. This concerns discrim- 
ination. We know that increasing the level of need increases the tendency 
to perform an appropriate learned act. But does it also increase the sharp- 
Ness of a learned discrimination? Suppose we have trained an animal to 
Perform some discriminative act in which reinforcement (food) is given 
for Pressing a lever in the presence of a light and not Pressing in the ab- 
Sence of the light. If the level of need is then increased by starving the ani- 
Mal for a period of time, we find that the rate of responding in the presence 
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of the light is greater and that responding in the absence of light ee 
greater. Thus it appears that while increased need raised the eR 
of responding it did not improve the discrimination; i.e., it did Dot cha 
the ratio of responding to the reinforced stimulus, over responding to the 

“ nonreinforced stimulus (Dinsmoor, 1952). Thus, heightened need does not 
appear to increase the sharpness of a discrimination, though it will increase 
the absolute tendency to perform the appropriate response. | 

Performance of learned acts and irrelevant needs. We have just SEER 
that increasing the level of need results in greater general activity and in a 
greater tendency to perform acts which have in the past satisfied ‘these 
needs. The question arises as to whether needs irrelevant to a particular 
pattern of behavior also tend to increase the tendency to perform that Ee- 
havior. Thus, if we have trained a rat to press a lever in order to ECE 
food, will making the rat thirsty increase the tendency to press the lever: 
Hull (1943), in his first important book on behavior theory, predicted, from 
his theory, that irrelevant needs would so affect behavior. The experimental 
literature does not clearly support Hull in this, however, and it will be 
worthwhile to see exactly how this is so. 

Some experiments show that after an animal has been trained under one 
kind of need, increasing the level of a different kind of need increases re- 
sistance to extinction (Webb, 1949; Brandauer, 1953). This has generally 
been interpreted to mean that the level of an irrelevant drive does affect 
learned acts specific to another drive. There is, however, a serious flaw in 
these experiments. The animals in these experiments were trained when 
they were hungry. Before extinction they were made thirsty by deprivation 
of water, but were given complete access to food. We know, however, haf 
rats will not eat much food when they are thirsty, and so, during extinction 
the animals in these experiments were probably slightly hungry as well as 
thirsty. This interaction between hunger and thirst has been very carefully 
studied, and we know that it can account for the effects in studies of ir- 
relevant needs such as those above (Verplanck and Hayes, 1953). 

In order to investigate this Problem adequ 


ately, it is necessary to use 
drives that do not interact upon one another. 


Strange (1954), taking ad- 
vantage of the fact that albino rats show a strong aversion to moderate 
light, trained a number of them to press a lever in order to turn off a light 
for a period of time. He then studied the effects of different levels of hunger 


and thirst upon the extinction of this light-aversion habit. The results clearly 


showed that changing the level of hunger or thirst had no effect at all at 
the resistance to extinction of the light-aversion habit. Thus, it looks 2 
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though when possible interactions between needs are controlled, there is 
little or no effect of irrelevant need upon habit. 

Other kinds of motives. We must not assume that all of the motives of 
animals are based upon conditions of organic need. As we shall see later, 
there is some indication of an important class of learned motives. Further- 
more, it now appears that there are many basic, unlearned motives that are 
not directly tied to some condition of organic need. After a long period in 
Which theorists tried to reduce the motives of animals to a small list of 
Organic need, several investigators simultaneously discovered, in different 
animals, that such things as visual exploration and manipulation of the en- 
Vironment are strong motives. 

For example, Montgomery (1951, 1953), has shown that visual explora- 
tion is a strong motive in the white rat, which accounts for many of the 
Peculiar features of maze behavior in rats. A series of studies on the rhesus 
monkey has clearly demonstrated that these animals are strongly motivated 
by Curiosity and the desire to manipulate interesting objects (Harlow, 
1950a; Butler, 1953; Harlow and McClearn, 1954). Not only are such 
motives more powerful and pervasive than anyone had thought, but it 
appears that they can be extremely resistant to the effects of satiation 
(Montgomery, 1954; Butler and Harlow, 1954). Thus they are probably 
Very effective in producing a variety of reinforcements in learning. While 
it is convenient to use food or water as reinforcement in learning experi- 
ments, it is not necessary. As a matter of fact, simple curiosity about stimuli 
is sufficient motive to provide intrinsic reinforcement in the learning and 
Performance of a visual discrimination in rats (Thompson and Solomon, 
1954), 

Motivation and learning jointly de 
that influence behavior are classified by contemporary psychologists into 
two classes, associative and nonassociative. Both factors are responsible for 
the performance of any particular instrumental act. If an animal performs a 
Particular act, it is highly probable that this is because (1) it has learned 
to do so and (2) it is motivated to do so. If motivation directed toward 
the Particular goal or incentive associated with the act is low, the probabil- 
ity of occurrence of the act will be low. Likewise if habit strength for this 
Particular act is low, the probability of occurrence of the act will be low, 
Cven though the appropriate motivation may be present. 

. Hull (1943) made this joint determination of response strength by mo- 
tivation and habit a keystone of his systematic account of behavior, and 
Many other theorists and experimental investigators have explored the im- 


termine performance. The factors 
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plications of the relationship. Incidentally, Hull formally specified the rela- 
tionship as a multiplicative one—the effects of motivation are multiplied by 
the effects of learning in order to achieve the appropriate level of strength 
of response. In a multiplicative relationship, if either factor is absent, the 
product is zero, so there will be no response strength if either motivation or 
habit strength is zero. We cannot take this too literally, for we do not know 
what is meant by “zero” motivation, or even by “zero” habit strength. 

Hull has shown how the joint determination of response strength by 
motivation and habit works by summarizing some data from other experi- 


NUMBER OF REACTIONS TO EXTINCTION (N) 


NUMBER OF REINFORCEMENTS (R) 


Figure 22. Resistance to extinction as a joint function of number of reinforcements and hours of 
deprivation. The actual obtained data are represented by the black points. The curves are derived 
from an equation based upon the theory of C. L. Hull. (Perin, 1942.) 


menters (Perin, 1942; Williams, 1938). These investigators studied the 
influence of number of reinforcements (amount of training) and number 
of hours of food deprivation (hunger) upon resistance to extinction of 4 
lever-pressing habit. The results Can be seen in Figure 22. This figure shows 
something of the relationship between habit and motivation, although it 
seems likely that the exact nature of the relationship would be modified for 
other habits and other motives. 

It is important to realize that this curve shows nothing of the way in 
which motivation might modify learning. It only shows how motivation an! 
habit jointly determine the strength (measured by resistance to extinction) 
of a particular habit. The effect of motivation upon the rate of learning it" 
self is a fundamental problem, however, and we shall deal with it next. 
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THE EFFECT OF MOTIVATION UPON LEARNING 


In Chapter 2, we saw that one of the major conclusions to come out of 
the study of reinforcing stimuli is that factors relating to reinforcement— 
Such as amount, quality, presence, or absence of reinforcement—do not 
intrinsically affect learning but, rather, performance. That is to say, rein- 
forcement seems to be critical only in getting an animal to perform some act. 
The latent-learning experiments illustrated that with a little ingenuity it is 
Possible to get animals to learn without specific reinforcements being asso- 
ciated with the act to be learned. Since motivation is simply a way of char- 
acterizing the internal state which leads an organism to respond positively 
to a particular reinforcement, it would seem likely that motivation would 
have an effect only on performance. Within limits this would seem to be 
true. Let us examine the experimental evidence on the question. 

Latent-learning experiments. In Chapter 2 we discussed one kind of 
latent-learning experiment in which hungry rats were allowed to explore a 
Maze before the introduction of a reward. In general, the results of several 
different experiments of this type showed, as was just mentioned, that learn- 
ing was independent of the kind of specific reinforcement employed. Rats 
allowed to explore a maze without food reward will, under the proper condi- 
tions, show that they have learned the maze as well as rats rewarded for 
every trial. 

The kind of latent-learning experiment important to the question of mo- 
tivation and learning is a little different. The animals, instead of being 
Motivated but not rewarded, are rewarded but unmotivated. For example, 
satiated rats may be allowed to explore a maze which contains a reward. 
If the reward is food, the animals are well fed before they are allowed in 
the maze. The result is that they do not seem to prefer to choose the path 
ending in reward; if they do find the food, they are disinterested. Since we 


Wish to know if the rats have learned the location of food despite their in- 


difference toward it, we make the rats hungry and then put them back in 


the maze. If they run the maze as well as animals receiving an equal number 
of reinforced trials when they are hungry, We should conclude that motiva- 
tion is not necessary to learning, but only to performance. : 
The experiments that have been reported are somewhat more compli- 
cated than the one outlined above. Usually they have been elaborated by 
the addition of a second need state, so that animals are permitted to explore 
2 maze, not just satiated with food, but thirsty. If water is present at some 
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point in the maze, animals readily learn to find their way to the water, but, 
of course, the critical problem is whether or not they will also learn the 
location of food. This was essentially the situation in a well-known experi- 
ment by Spence and Lippitt (1946). It turns out, incidentally, that Spence 
and Lippitt’'s rats did not apparently learn the location of food when they 
were thirsty and reinforced by water. As can be seen, however, the same 
criticism— based on the lack of independence of thirst and hunger— applies 
to this experiment as applied to the studies of performance and irrelevant 
need. 

Perhaps because these latent-learning experiments have been unduly 
complicated, the results of different experiments have been contradictory 
and confusing. Many of these experiments are critically reviewed in a paper 
by Thistlethwaite (1951), and it is unnecessary for us to discuss them in 
detail. Unfortunately, however, the basic issue has been lost in many of 
these studies. It has been pointed out (for example, Deese, 1950) that the 
fundamental issue is whether some aspect of the learning function (and 
this can be very exactly stated as a mathematical parameter) is or is not 
a function of the level of motivation. Adding several different drives and 
rewards to the latent-learning experiments merely complicates this issue. 
Because so many of the latent-learning studies are thus complicated, it will 
be well for us to turn to another kind of experiment in order to settle this 
issue. 

Experiments comparing effects of level 0f motivation. Tn the past few 
years, a large number of experiments have been designed to assess the influ- 
ence of motivation upon learning. Most of these experiments have used 
resistance to extinction as the measure of strength of response. The experi- 
ments are usually designed so that various groups of animals are trained 
to press a lever in the Skinner box under various levels of motivation and 
then all are extinguished under a common level of motivation. If different 
levels of motivation produced different effects on amount of learning, re- 
sistance to extinction in the various groups should be different, even though 
all animals are extinguished under the same motivation, because resistance 
to extinction should reflect the effects of conditions of learning. Nearly all 
of the experiments of this sort show negative results; different levels of 
drive during learning have little or no effect upon resistance to extinction 
(Kendler, 1945; Reynolds, 1949; Strassburger, 1950). One study of this 
type showed slightly different results (Finan, 1940). In this study some 
animals were trained after only one hour of food deprivation, and others 
were trained after 12, 24, and 48 hours. All animals were extinguished 
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after 24 hours. There was slightly less resistance to extinction in the groups 
trained after 1 and 48 hours of food deprivation than in the 12- and 24- 
hour groups. While the differences found in this experiment are small, they 
are interesting because they suggest that the stimuli arising from various lev- 
els of hunger show stimulus generalization in much the same way as exter- 
nal stimuli; since the animals were all trained under moderate hunger, the 
center of the generalization gradient was at moderate levels. 

One experiment (Carper, 1953) that has attempted to assess both the 
direct influence of motivation upon performance and learning in the Skinner 
box shows very clearly that resistance to extinction is very sensitive to 
motivating conditions prevailing during extinction but highly impervious to 
any differences in motivating conditions prevailing during training. 

Experiments on other kinds of instrumental responses have shown more 
complicated results. For one thing, it seems clear that there is a residual 
effect from high motivation if animals are switched to low motivation 
(O’Kelly and Heyer, 1948; Deese and Carpenter, 1951). That is to say, 
animals trained under high motivation and then switched to low motivation 
will perform better than those always tested under low motivation. As in 
the studies of resistance to extinction in the Skinner box, however, animals 
trained under low motivation and then switched to high motivation do as 
well as those tested under high motivation all along, provided the habit is 
a simple one (Deese and Carpenter, 1951). Surprisingly enough, even in 
So complicated a habit as that required in learning a 10-unit maze, animals 
trained under low motivation do as well as animals trained under high mo- 
tivation when tested under high motivation (Hillman, Hunter, and Kimble, 
1953). 

This is surprising because, as we have mentioned, the independence of 
learning from kind and quality of reinforcement held only for simple habits. 
As Seward (1951) has pointed out, a reinforcer does not strengthen a habit 
so much as it provides an incentive for its additional use. In complex habits 
the problem is complicated by the fact that a poorer quality of reinforce- 
ment provides incentive for only indifferent performance. Since animals 
learn only what they do or experience, if there is little or no incentive to be 
active, animals will not learn much. The same argument, of course, applies 
to need or motivation; if an animal is to learn, it must be motivated at least 
enough to be goaded into activity. 

This does not mean that motivation and reinforcement intrinsically affect 
the rate of learning, but merely that if these things are missing animals will 
not be induced to perform complicated acts. In the case of the experiment 
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on maze learning outlined above, curiosity probably helped the unmotivated 
rats to explore the maze even in the absence of food or hunger. This motiva- 
tion was weaker than hunger, consequently their performance in the maze 
was poorer than that of the motivated rats. It was, however, good enough 
to permit the unmotivated rats to learn the correct choices as well as did the 
motivated rats. 

Thus we may conclude that for relatively simple habits, learning under 
low motivation is about the same as learning under high motivation. Learn- 
ing of complex habits is modified by the influence of level of motivation on 
performance. Therefore it looks as though the associative mechanisms of 
learning are largely independent of the motivation-reinforcement factors 
that control behavior. These associative mechanisms are only incidentally 
brought to play in most learning situations because the combination of mo- 
tivation and the reinforcement specific to that motivation induces the or- 
ganism to act. Campbell and Kraeling (1954) point out that in many situa- 
tions animals may learn responses that are qualitatively different when they 
learn under different levels of drive. 


SECONDARY MOTIVATION AND LEARNED INCENTIVES 


For a long time many psychologists took the view that there were but a 
handful of unlearned motives, most of which, if not all, were biological 
needs. If one takes this view it is difficult to describe all the elaborate mo- 
tives of adult human behavior unless great emphasis is placed upon the 
development of learned motives. All of the complicated motives not directly 
and obviously connected with biological motives have been considered to 
be derived from biological needs by a process of learning. 

Although today most psychologists grant an important role to learned 
motives (or at least to learned incentives), the prevailing opinion seems to 
be that the view stated above is too simple. We are less sure today that all 
unlearned motives are exhausted by a simple catalogue of biological needs. 
As we examine the behavior of animals and men in nature, it is difficult to 
untangle motives that seem to be learned from those that are not. It is pos- 
sible that the actions of all motives, even those directly rooted in biological 
needs, are modified by learning and that there are some important motives 
that are simply the result of associative learning. Even though we are no 
longer convinced that all secondary motives are entirely learned, learning 
does play an important role in the elaboration of motives, and it will be 
worthwhile to examine some of the mechanisms of learning in motivation. 
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Secondary Drives 

We saw in the chapter on reinforcement that animals sometimes come to 
behave toward a hitherto neutral stimulus situation as if it had incentive 
value. Thus, if a chimpanzee, as the result of taking part in an experiment, 
comes to behave as if it were fascinated with poker chips, or if a rat comes 
to behave as if it were mortally afraid of a flashing light, very likely we shall 
say that the poker chips and the flashing light have acquired secondary re- 
inforcement value. Sometimes it has been assumed that the development of 
the incentive, or reinforcement, value for the neutral stimulus has been ac- 
companied by the development of an acquired drive. There are some diffi- 
culties with this notion, as we shall see later. For the moment, however, let 
us examine some of these experiments on acquired or token rewards. 

Token-reward studies. An interesting variation in the secondary-rein- 
forcement studies is found in the studies of token rewards. The general idea 
in such studies is to choose some primary drive, such as hunger, and some 
token, such as a poker chip, that can be used by an animal to “purchase” 
food. It is not too difficult to train chimpanzees to learn to use such tokens 
in food-vending machines as well as to discriminate among the various col- 
ored chips according to value (Wolfe, 1936). Furthermore, the chimpan- 
zZees can be taught new habits in order to obtain the chips. 

It is possible to get chimpanzees to accumulate such tokens for a while 
without being paid off in food (Cowles, 1937). Therefore it would appear 
that one could characterize the tendency of the apes to work for and ac- 
cumulate poker chips as a learned drive or motive. Several theorists, how- 
ever, have shown that this is a gratuitous notion (Brown, 1953; Seward, 
1953). Brown points out that the pressure to assume a learned drive for 
Poker chips would have been much greater if the apes had been either will- 
ing to work for the chips when they were not hungry or more willing to work 
for the chips when they could not be exchanged for food. Thus he implies 
that the chimpanzee’s token-reward habit is based upon secondary rein- 
forcement of a hunger motive, not upon a new drive for poker chips. 

There are few studies in the experimental literature that suggest that ani- 
mals do develop new drives by association with the reinforcement of an 
old one. The results of some studies are hard to account for without the 
assumption of learned drives; perhaps the best known of these experiments 
are those of Anderson (1941a; 1941b) on what he called “externalization 
Of drive.” In these, rats were required to learn a completely new maze solely 
on the basis of secondary reinforcement. These rats would learn the new 
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maze whether or not they were reinforced with food, or (and this is the 
important point) whether or not they were either deprived of food or sati- 
ated. The only requirement for learning to take place was that the new maze 
contain cues which had been associated with food in the past. Anderson 
characterized this kind of learning as “externalization of drive” because it 
showed that a motive had become established to an external Object rather 
than an internal state of need. 

Thus it seems possible that rats (and perhaps men) can acquire motives 
for previously indifferent goal objects. There are, however, very few ex- 
periments on this sort of thing, and therefore there are a lot of problems un- 
solved. What would happen, for example, if some new cues were associated 
with the reinforcement not of one drive, but of several drives? Would the 
resulting “learned” drive be more stable, less resistant to extinction, etc.? 
Would a partial reinforcement schedule during training make any difference 
in the strength of the learned drive? These are important questions, and 
the evidence we now have on secondary drive is not very adequate.! If we 
are able to make use of this concept in explaining more complicated prob- 
lems in human behavior, our extrapolations would make more sense and 
perhaps be easier to assess if we had better experimental evidence in lower 
animals. It is still possible, for example, that Anderson’s “externalization of 
drive” effect represents merely the arousal of a native curiosity drive. These 
are things that should be examined more thoroughly. 

Fear as a drive. We shall have to consider in detail in the next chapter 
the question of fear as a learned response. Ordinarily fear is considered to 
be an emotion, but many psychologists have pointed out that it can have 
motivating properties. It is responsible for variations in the strength of cer- 
tain instrumental acts, and reduction or elimination of it is apparently rein- 
forcing. Since fear is frequently learned (though not always so, contrary to 
an impression one would get from reading certain parts of the current psycho- 
logical literature), it is an important example of a learned drive. Unlike 
secondary or learned motives built upon hunger, thirst, or some other sim- 
ilar organic condition, learned fears are relatively easy to establish and 
examine in the laboratory. 

Miller (1948, 1951) presents us with the classical example of fear as a 
learned drive. A rat is introduced into an apparatus like that illustrated in 
Figure 23. There are two compartments in the apparatus, a white one with 
an electrically wired grid as the floor and a black one with a smooth solid 


1 Wike and McNamera (1955) have made a start 


at answering these questions. The 
results of their experiments show that the 


answers obtained will not be simple. 
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floor. There is a door between the two compartments; the rat can open this 
door either by pressing a lever to the side of the white compartment or by 
turning a wheel at the end of the white compartment. First of all the rat is 
Shocked in the white compartment and allowed to escape into the black 
compartment. This teaches the rat that the white compartment is to be 
feared and that the black compartment is safe. In order to test for a learned 
drive, however, it is necessary to find out whether the rat will learn a new 


Figure 23. Apparatus for demonstration of fear as a learned drive. One compartment is white 


and one is black. The door between the compartments can be opened either by pressing a lever or 
by turning a wheel. The rat learns to escape one or the other compartment by the association of 


electric shock-produced pain and the stimuli from that compartment. (Miller, 1948.) 


Tesponse to escape from the white compartment without further electric 
shocks. Miller showed that rats could learn to turn the wheel, with escape 
into the black compartment as reinforcement, much 8 Ey Sher instru- 
mental response is learned. Figure 24 shows that each time the rat is placed 
in the white compartment it takes less time for the rat to get around to 
turning the wheel. 

Thus it seems clear that acquired fear can serve as a4 Motivation for in- 
Strumental activity and that reduction or elimination of this fear can serve 
as reinforcement. The importance of acquired fear in emotion and conflict 


is dealt with in the next chapter. 
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Figure 24. Learning to turn the wheel with escape from the shock compartment as a motive. The 
rat is shocked in the white compartment, from which it can escape by an open door. On subsequent 
nonshock trials, the rat must learn to turn the wheel, which opens the door, in order to escape from 
the compartment in which it had been shocked. (Miller, 1948.) 


Theories of Secondary Drives 


Acquired fear, as outlined in the experiment presented above, is fairly 
easy to produce. Indeed, it is sufficiently simple and reliable that it may be 
used in a laboratory or demonstration in elementary psychology. Learned 
drives based upon reward are very difficult to demonstrate, however, and, 
indeed, Myers and Miller (1954) present strong evidence for the existence 
of artifacts in the experiments purporting to demonstrate them. They sug- 
gest that these experiments (like Anderson’s, mentioned above) are proba- 
bly influenced by exploratory drives or Curiosity. 

Thus there seems to be a real difference in the ease with which secondary 
appetitive drives and secondary aversive drives are produced. In fact, there 
is not yet an unequivocal demonstration of the development of a secondary 
appetitive drive in the literature. Several reasons have been advanced for 
this difference, and it has become a matter of importance in theories of be- 
havior. 

Miller (1951) argues that it may have to do with the relative abruptness 
of onset of the primary drive. Aversive drives can be readily aroused by 
noxious stimuli (such as electric shock), so it is relatively easy to pair a 
particular stimulus condition with the arousal (and reduction) of the drive. 
Appetitive drives develop slowly; it takes time to become hungry. There- 
fore hunger, as a drive, cannot be neatly paired with a particular stimulus 
condition. Furthermore its reduction (by eating) is Probably a more grad- 
ual and complicated affair. 
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Seward (1953) advances an argument also based on a distinction be- 
tween primary drives that have sudden and those that have gradual onsets. 
He suggests that drives with sudden onsets may be conditioned directly to 
neutral stimuli (and thus give rise to fear in the case of pain as a primary 
drive). Drives with gradual onsets, however, are learned because of the ob- 
Struction or blocking of a learned response. 

This brings us to the views of Mowrer (1953). He states that any stim- 
ulus that happens to occur with some increase in drive strength produces a 
capacity in the stimulus for eliciting a token (learned) drive. This token 
drive is emotional in character and akin to fear (in the case of hunger, pre- 
sumably it would be fear of hunger, if it were possible to pair an increase 
in hunger with some simple stimulus). A token-drive decrement is reinforc- 
ing, and thus whenever a response brings about reduction of fear, this re- 
sponse is increased in strength. 

This particular notion leads to the view that fear, or at least some kind of 
general avoidance, is characteristic of all secondary drives. Thus, it would 
seem that if the less obviously biological motivation of human life is to be 
explained as secondary drive, we are driven principally by fear or at least 
by something to be avoided. This, of course, is an unwarranted extrapola- 
tion, and to say the least, does not do justice to Mowrer’s views. It does, 
however, suggest the more temperate and perhaps important view that much 
of human motivation could be conceived of as learned fear or anxiety. 
Perhaps one trouble with this view is the implication, which is hard to es- 
cape, that fear or anxiety is necessarily unpleasant or to be avoided. Yet 
itis a commonplace that human beings will deliberately place themselves in 
fear- or anxiety-evoking situations (such as dangerous speeding or riding a 
roller coaster), and it seems gratuitous to assume that people do this simply 
because it feels so good when they stop. It is very possible that fear or anx- 
iety has attractive as well as negative qualities. 

These matters we have been discussing are important to the theory of 
human motivation, and no doubt the next few years will see intensive activ- 
ity in trying to fit them to problems in human behavior. At present, however, 
they stand as hypotheses derived from animal experiments, Which have the 
Potential of providing us with some reliable explanations of human behav- 


ior. We cannot force them to do more at present. 


MOTIVE AND INCENTIVE CONDITIONS IN HUMAN LEARNING 


Discussion of the nature of human motivation has probably covered 
more paper than any other psychological topic. We must treat this vast liter- 
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ature rather cursorily, for the simple reason that not very much of it is 
relevant to the question of the relationship between motivation and learning. 
Some of it is, however, and a few introductory words about theories of the 
origin of human motives are necessary in order to set the stage for the em- 
pirical and experimental studies we shall discuss in detail. 


The Origin of Human Motives 


A good deal of the theorizing about human motives has concerned the 
problem of their origins. An older view was that human motivation sprang 
from a number of instincts (the number varying with each theorist) com- 
mon to all human beings and exhibited by them in varying degrees. While 
the multiple instinct view gives some signs of recovering from the extreme 
neglect it has suffered for the past thirty years, it is still not an influentiat 
notion. Today, the predominant view is that human motivation stems from 
either a small number of basic urges or even one basic urge, and that all of 
our concern for family, money, prestige, social status, security, etc., comes 
from an elaboration of this basic urge or urges through experience and 
maturation. This seems to be the fundamental viewpoint of psychoanalytic 
theorists, and of those theorists who would see human motivation built up 
through a process of secondary reinforcement and of secondary-drive 
learning. 

Both the psychoanalytic viewpoint and the secondary-drive viewpoint 
seem to agree that human motives go through a developmental process. 
There is a great difference between these theories, of course, in emphasis, 
method, and implications, but their major difference as far as we are con- 
cerned here is that the psychoanalytic view would put greater stress on 
maturation and the secondary-drive view, on learning. 

Both of these theories, however, seem to agree that goal relationships 
have to be learned. In other words, most of the rewards and punishments 
in human life are not innately rewarding or punishing, but develop these 
characteristics through learning. Thus the child’s motives change and elabo- 
rate as it comes into various relationships with its parents and siblings. 

Most psychologists are convinced that the notions of human motivation 
given to us by psychoanalytic or secondary-drive theory are too simple, 
but unfortunately there is no other systematic theory that approaches these 
in completeness and inventiveness. Thus, while we agree that they are prob- 
ably partially right—human motivations do change through experience and 
maturation—we reserve the opinion that not all of the complex motives of 
human life can be demonstrably derived by either of these theories. 
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Experimental Study of Human Motivation 


Experimental approach. Motivation is an invented concept used to de- 
scribe certain important things about animal and human behavior; it is not 
a fact of the world or an experimental variable that can be directly con- 
trolled and manipulated. We associate this concept of motivation with a 
number of variables that we can manipulate directly, however, and these 
include such things as depriving animals of food for periods of time or 
changing their bodily balance by the introduction of drugs, etc. 

In the study of human motivation we are much more limited in the fac- 
tors associated with motivation that we can manipulate directly. There are a 
few experimental studies in which the influence of food deprivation upon 
some aspect of human behavior has been studied (see Lazarus, et al., 1953), 
but these studies are few and frequently not basic to our problems. In the 
older literature the most frequent technique in the study of human motiva- 
tion has been to take advantage of the covariation assumed between motive 
state and incentive value—the assumption that most people are “more mo- 
tivated” for $10 than $1. Thus, almost all of the experimental work on hu- 
man behavior cited in P. T. Young's well-known book, The Motivation of 
Behavior (1935), is on the effect of incentives upon human behavior. It is, 
however, desirable to use other techniques, if for no other reason than to 
study the interaction between goal or incentive value and states of the in- 
dividual induced by other conditions of his life. 

It is extremely difficult to produce conditions in the experimental labora- 
tory that allow us to vary systematically the factors associated with human 
motivation. For this reason, in recent years, there has been an increased 
emphasis in the study of human motivation upon what has been called re- 
Sponse-response relationships. This means that we use some behavior of 
the individual to tell us how he is motivated, and then use this behavior to 
predict something further about an individual— perhaps how he will per- 
form on a test of learning. Thus one of the main techniques in the study of 
human motivation has been the correlational approach. There are certain 
inherent limitations in this approach, the principal of which is the inability 
to eliminate extraneous variables. Thus, there is considerable argument 
over the merits of the response-response technique, or correlational ap- 
proach, to the study of human motivation. 

Studies of incentive value. There is a large literature in industrial psy- 
chology on the relationship between various incentives and output. Most 
Of this is not particularly relevant to learning, but two examples will demon- 
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strate a point. Kitson (1922), in a classical study, showed that industrial 
workers of long experience may not necessarily be working at top efficiency. 
Despite the fact that the typesetters whom he studied had ample opportunity 
to learn their trade to perfection, the addition of a monetary incentive re- 
sulted in a great improvement in output. Similar studies (Mace, 1935) have 
Shown that the addition of an incentive may delay or eliminate work decre- 
ment due to continuous work. 

Various incentives used in industrial work create many problems of a 
psychological and sociological sort. While these are fundamental to an un- 
derstanding of human behavior in industrial society, they are not directly 
important to the psychology of learning. Consequently, the techniques of 
incentive payment in industry will not be stressed here. Suffice it to say 
that the use of incentives such as piece-rate payments or special bonuses 
does not always work as we think it should. The motivations of workers in 
modern industrial society are very complicated. It is true that the human 
being in our civilization seems to be almost universally motivated for the 
acquisition of coin, but this is frequently less important than other motives. 
He may be motivated to earn a certain, minimum amount of money with 
little effort. He may be motivated by such things as security, social approval, 
feelings of self-respect, etc. Thus, the manipulation of money as an incentive 
may not always lead to the expected results. 

In the school, grades, verbal rewards, and reproofs have been intensively 
studied. These factors probably operate less upon learning itself than upon 
performance, but there is little reason to doubt that they are powerful tech- 
niques in the control of performance level. The effects of specific verbal 
rewards and punishments are sometimes difficult to evaluate, however, be- 
cause there are enormous individual differences in the effectiveness of these 
incentives (differences probably associated with individual differences in 
basic patterns of motivation). 

Motivation assessed by a self-report inventory. Let us turn to the tech- 
nique of assessing motivation by means of the behavior of the individual. 
The most commonly employed technique here is the use of self-report in- 
ventories. These consist of a series of items such as “I am sometimes trou- 
bled over what others think of me” which the individual answers by true or 
false alternatives. In the experimental study of motivation the largest num- 
ber of investigators have made use of a self-report inventory known as the 
Manifest Anxiety Scale (Taylor, 1956, etc.). The theory behind this inven- 
tory is that the items in it tap an emotional responsiveness that is related 
to drive or motivational level. Those individuals scoring high on the inven- 
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tory are said to be anxious, and their anxiety state is assumed to act as a 
generalized motivator to behavior. Because there is some emotional basis 
to this kind of anxiety drive, it has been questioned whether the Manifest 
Anxiety Scale primarily reflects properties ordinarily associated with emo- 
tion or with motivation (Deese, Lazarus, and Keenan, 1953; Eriksen, 1954). 
Consequently we shall deal with many of these same problems in the chap- 
ter on emotion and learning. For the present, however, let us see how 
far a consideration of manifest anxiety as an indicator of motivation 
gets us. 

At first blush the results of experimental studies are very favorable to 
the interpretation of this inventory as a measure of anxiety drive. There 
seems to be little doubt that classical conditioned responses (either the 
galvanic skin response, or a skeletal reflex such as the eyeblink) occur more 
readily in highly anxious individuals than in individuals reporting little anx- 
iety (Taylor, 1951; Spence and Taylor, 1951; Bitterman and Holtzman, 
1952; etc.). This is what one would expect if high drive had a generalized 
effect of increasing tendency to respond. 

Beyond these studies of classical conditioning the results become less 
clear, however. For example, in a study of the frequency of eyeblinks (not 
conditioning, but simply a study of frequency of occurrence), there was no 
tendency for individuals high on the anxiety inventory to blink more fre- 
quently, even when under conditions of tension (Meyer, Babrick, and 
Fitts, 1953). Curiously enough, however, there was a tendency for fre- 
quency of blinks to correlate with another inventory of maladjustment. 

When we turn to the study of more! complicated, instrumental acts, the 
Picture becomes even more confused. Here, very frequently high anxiety is 
associated with poor performance (Taylor and Spence, 1952; Farber and 
Spence, 1953). High drive leading to poor performance would, at first 
glance, seem to be contrary to the usual effects of drive, but this is not 
necessarily so. For if anxiety is a truly generalized drive it will activate in- 
correct as well as correct responses. If, as is frequently the case in complex 
incorrect responses are initially stronger than correct re- 
sponses, it is argued that the effect of increasing drive would be to increase 
the difference between incorrect and correct responses. Therefore, one 


might reasonably expect, so the argument goes, that high anxiety would be 


associated with poor performance in some situations, at least initially. Fur- 


thermore, one experiment (Montague, 1953) shows that if the task to be 
learned gives ample opportunity for interference, subjects scoring high on 
the anxiety inventory do more poorly; but if the task gives little opportunity 


human learning, 
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for interfering effects of incorrect on correct responses, subjects high in 
anxiety do better. 

Now this seems all very well, except that a direct test of the influence of 
drive state upon the interfering tendencies of incorrect responses does not 
seem to bear the analysis out. Buchwald and Yamaguchi (1955) compared 
the effects of habit reversal in the maze learning of rats under high and low . 
thirst. The results clearly showed that reversal learning was more rapid 
under high drive, despite the fact that reversals in maze learning provide 
just about the optimal conditions for interference. 

The fact is that a hypothesis about the effect of drive upon interfering 
tendencies has never been clearly stated. Taylor and Spence (1952; Tay- 
lor, 1956) say that because drive multiplies habit strength to determine 
response strength, increasing the drive ought therefore to increase rela- 
tively strong habits at the expense of weak habits. Thus, if incorrect habits 
are strong they ought to be increased more than weak habits. This assump- 
tion, however, is not at all clear from the mathematics. To be sure, mul- 
tiplying two different numbers by a constant (equivalent, according to 
Taylor and Spence, to the effect of high drive on strong and weak habits) in- 
creases the absolute difference between them, but the ratio of the two habits - 
remains the same. Some theorists have assumed that any choice an organ- 
ism makes between two responses is based upon the ratio of their strengths 
(or their probability of occurrence, if the two responses exhaust the possi- 
bilities), and in this case increasing drive level would not differentially in- 
crease the strength of the stronger habit. 

Thus, while the idea of using the Manifest Anxiety Scale as an index of 
motivation is an interesting one, the implications have been somewhat dis- 
appointing. In the next chapter we shall examine some of the implications 
of using this as well as other, similar scales as indices of emotional reactivity 
to stress. As we shall see, it is probable that our current concepts of emotion 
and motivation are closely allied to each other and that any really serious 
attempt to decide whether the Manifest Anxiety Scale reflects more of one 
than the other will await clarification of the concepts themselves. 

Another kind of anxiety inventory has been used by other investigators 
(Mandler and Sarason, 1952; Sarason, Mandler, and Craighill, 1952). This 
inventory was designed to assess the individual’s reaction to situations in 
which his abilities are under test. Does, for example, an individual display 
signs of anxiety and worry when he is taking an important examination? 
The authors of this inventory believe that it taps both motivation for 
achievement and emotional responsiveness to conditions of stress. Thus in- 
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dividuals scoring high on this inventory are not only higher in motivation, 
but also, because of past learning, are likely to exhibit signs of emotional 
“disturbance which might interfere with performance. 
Generally speaking, individuals who score on the high end of this “test 
anxiety” inventory do more poorly in tests of learning and performance. 
+ This is particularly true in the case of tests known to be sensitive to emo- 
tional interference. In some cases, however, these individuals do better than 
those scoring low; they increase their scores more with practice. This is 
Probably due both to the fact that these individuals adapt to their states of 
, emotionality and to the fact that they are probably more highly motivated. 

Thus, the studies of individual differences in the things tapped by these 
personality inventories suggest that in human life, there is a close relation- 
Ship between many kinds of motivation and emotion. These are so inter- 
twined with one another that it is probably impossible simply to say that 
higher motivation automatically means better performance. This also de- 
pends upon the kind and level of emotional reactivity. 

Projective tests of motivation. Another important technique in the as- 
sessment of individual differences in human motivation lies in projective 

« tests. The most ambitious of these has been an attempt to measure need 
for achievement by means of the Thematic Apperception Test (McClelland, 
et al., 1953). In this test, an individual's need is evaluated by means of the 
stories he makes up to fit pictures of individuals in various situations. In 
general, great reliance is placed upon the extent to which an individual 
Shows achievement imagery. The authors of this version of the Thematic 
Apperception Test were able to bring impressive evidence to show that 
they were actually measuring individual differences in motivation. 

What about the need achievement-learning and performance relation- 
Ship? One experiment (Lowell, 1952) rather clearly showed that there was 
a difference in performance during the learning of an anagrams word test. 
Those individuals high in need achievement did much better. A test on an- 
other task, in which subjects were not expected to show learning or im- 
Provement, suggested that the difference in performance associated with 
need achievement is correlated with performance level and not with rate 
Of learning. This is, of course, much what we should have predicted from 
studies of animal behavior. This relationship between need achievement and 
learning and performance may eventually be of practical importance, since 
there is evidence that need achievement scores are correlated with college 
grades, even when individual differences in intelligence are taken into con- 


sideration (McClelland, et al., 1953). 
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Thus, although the use of individual differences as a technique for in- 
vestigating motivation is relatively new, we have encouraging evidence that 
we shall learn something by it. The use of personality inventory scores has 
proved to be a little disappointing, largely because they are heavily saturated 
with emotional factors. There are many possibilities, however, for the de- 
velopment of inventories relatively free from emotional components (per- 
haps something like the traditional study habits inventory will work with 
college students), and this matter is not yet closed. The need-achievement 
scoring of the Thematic Apperception Test gives great promise, and re- 
cently a similar projective test has been used, with encouraging results, in 
the study of achievement motivation (French, 1955). We do not have much 
data on the relationship between these tests and the question of whether 
learning or performance is affected by motivation, but the evidence we do 
have suggests that it is largely performance rather than learning that is 
affected by different levels of motivation in human beings. Thus, any dif- 
ferences that do develop in learning are probably secondarily due to the 
basic differences in performance level. 


Incidental Learning 


We find another important technique in the study of motivation in hu- 
man learning in experiments on incidental learning. In these experiments 
the performance of individuals who practice at a task with deliberate intent 
to learn is compared with that of individuals who practice without this in- 
tent. Let us look at a simple example of an experiment in incidental learning. 

An experiment in incidental learning. One example of incidental learn- 
ing is provided in a study by Jenkins (1933). He had students, who 
thought they were serving as experimenters, read lists of nonsense words to 
other students, who were instructed to learn the words. After the learners 
had reached a certain criterion of performance, both the subjects and the 
experimenters were dismissed and told to return 24 hours later. At this 
time both subjects and experimenters were asked to recall the lists of words. 
It turned out that both the learners and the experimenters recalled many of 
the words; the learners, however, recalled more words than did the experi- 
menters. Upon asking for reports from the subjects, however, Jenkins found 
that some of the experimenters did deliberately try to learn, but even those 
who did not, recalled many words. 

This experiment shows that (1) even without specific instructions to 
learn, human beings will form a set to learn; (2) even without this specific 
set to learn, human beings will learn. In this connection, we may note, as 
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the set to learn decreased in Jenkins’s subjects, the amount of material re- 
tained 24 hours later also decreased. Thus, while it appears that subjects 
can learn without a specific set to learn, they seem to learn more adequately 
When they have such a set. 

Analyses of incidental learning. Incidental learning is a complicated 
effect, and there have been many experiments performed in an effort to 
untangle the basic causes of the differences between incidental and instructed 
Subjects. Right now, of course, we are interested in the question principally 
from the point of view of motivation. Is the lesser motivational set induced 
in the incidental learners responsible for their poorer performance? 

In order to answer this question we must look more closely at the experi- 
ments. Saltzman (1953), for example, asked whether or not the difference 
between incidental and intentional learning might be due to the difference 
in the orienting task required of subjects working under two different con- 
ditions. Translating Saltzman’s question into the context of the experiment 
Outlined above, we can see that even the task of being an “experimenter” 
might interfere with the incidental learning of words. Saltzman was able to 
Show that if the orienting task were allowed to interfere with the perform- 
ance of intentional learners, they actually did worse than incidental learners. 

In subsequent experiments (Neimark and Saltzman, 1953; Saltzman, 
1956) the problem was further explored; it turns out that if intentional 
learning is superior to incidental learning, this is so only under certain con- 
ditions (such as certain distributions of trials through time) and with cer- 
tain orienting tasks. Some orienting tasks given to incidental learners inter- 
fere with learning and others facilitate learning as much as does intent to 
learn. 

Further analyses of problems in incidental learning have been made by 
Postman and his associates. Like Saltzman, they (Postman, Adams, and 
Phillips, 1955) assume that the difference between incidental and inten- 
tional learning depends upon the kinds of responses people make to the 
material presented to them. Specifically, they thought, intent to learn would 
make it more likely that human learners would notice critical differences 
between various verbal items they were required to learn. The experimental 
results showed that when items to be learned were easy to discriminate, 
there was not much difference between incidental and intentional learning; 
but in those cases in which the items to be learned were unfamiliar and low 
in association value, there was a large difference. 

Other experiments in this same series (Postman and Adams, 1956; Post- 
man, Adams, and Bohm, 1956) further emphasize the role of intent to learn 
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in determining the kind of responses made by the learners and whether or 
not these will be appropriate to the task of learning. Thus it is fairly clear 
that the comparison between incidental and intentional learning is not sim- 
ple, since the different effects these two conditions have upon performance 
will depend upon the nature of the task to be learned and the learner’s ori- 
entation to the task. 

Incidental learning and performance versus learning. What do the ex- 
periments on incidental learning tell us about the comparative effects of 
motivation upon learning and upon performance? It is clear from these 
experiments that what is learned depends upon the given set for learning. 
To the extent that a quantitative index of learning depends upon what is 
learned, the amount learned will depend upon the set for learning. Thus it 
is not surprising that an attempt to find an analogy in human learning to the 
latent learning found in maze learning of rats was not too successful (Post- 
man and Tuma, 1954). The troublesome thing about this experiment seems 
to be that it is difficult or impossible to devise conditions in which subjects 
perform the correct responses for learning the task. It is probably unfair to 
push the analogy to maze learning in rats too far, but it is much as if an 
unmotivated rat refused to run through the maze and thus could hardly 
learn the maze, latently or otherwise. 

Therefore, it is clear that unmotivated people sometimes do not learn 
very well, not because of some intrinsic connection between motivation and 
rate of learning but because they do not perform the responses actually to 
be learned. Some years ago there came to the attention of the author a 
youngster who had completely failed to learn the mechanics of subtraction 
in school. He was a bright boy, but evidently during the school hours de- 
voted to subtraction he had hit upon the expedient of writing down numbers 
more or less at random in order to satisfy the superficial requirements of 
the classroom. Thus, this unmotivated youngster failed to learn subtraction 
for the simple reason that he would not £0 through the activity necessary to 
the learning of subtraction. In a practical way, motivation may be impor- 


tant to learning simply because it is important to the performance of the 
right responses. 


CHAPTER 6 


EMOTION AND LEARNING 


It is apparent to everyone that learning plays an important part in emo- 
tional and affective behavior. Most of our likes and dislikes are learned. 
We all know that many, if not most, fears are acquired. We know about the 
importance of experience in the development of personality disturbances. 
In this chapter we shall not consider all of these things, but rather, look at 
Some of the basic principles we must use when we apply the psychology of 
learning to problems in emotion. 

In applying the psychology of learning to problems in emotion, some new 
and important issues arise, and it is these issues to which we shall largely 
direct our attention. The problem of emotional learning does not involve 
merely the same principles we have discussed already; it involves new prin- 
ciples, as well as new ways of considering those we have already looked at. 
Before looking at the application of the psychology of learning to emotion, 
however, one basic point about the concept of emotion needs to be men- 
tioned. 

Even the most superficial examination of the biological and psychological 
characteristics of emotion shows that it is related to the concept of motiva- 
tion. Motivation, as it is most often used by psychologists, emphasizes the 
activating role of central mechanisms in behavior. It has been apparent for 
a long time, however, that the factors associated with the activation of be- 
havior are also associated with feelings of pleasantness and unpleasantness, 
as well as with expressive behavior in rage or fear (Lindsley, 1951). Recent 
anatomical and behavioral work has shown the intimate interaction between 
the central neural mechanisms associated with motivation and those asso- 


ciated with emotion (see Stellar, 1954). 
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It is certainly apparent that some kinds of emotional activity accompany 
powerful motivation. We saw in the last chapter, for example, that a learned 
fear can motivate rats to learn a new instrumental act. The fear is clearly 
an emotional state, but it is also a motivational state. It turns out that a 
large portion of the experimental literature on the relationship between 
emotion and learning is devoted to an analysis of the motivating properties 
of such powerful, unpleasant emotions as fear. 

Before we consider the motivating aspects of emotion, however, let us 
see how emotions themselves are learned at the most elementary level. In 
order to do this, we shall examine some studies of emotional learning in 
animals and very young children. 


CONDITIONING AND EMOTIONS 


An important point about the psychology of emotions is that it has al- 
ways been associated with the activity of the autonomic nervous system— 
that part of the nervous system controlling the actions of the heart, glands, 
and smooth muscles. The activity of the autonomic nervous system is not 
voluntary, and the elements of emotional expression autonomically con- 
trolled are automatic and reflex-like in character. Thus, acceleration of the 
heartbeat, sweating, blushing, etc., are all things that occur more or less 
automatically, given the presence of some emotion-producing condition. 

This involuntary characteristic of elements of emotional expression is 
important, because it suggests that the conditioning of these responses is 
classical rather than instrumental. In the first chapter, we saw that in classi- 
cal conditioning responses were elicited directly by stimuli. An uncondi- 
tioned stimulus elicits a response and this response can be transferred to a 
hitherto ineffective stimulus simply by pairing that stimulus with the uncon- 
ditioned stimulus. In instrumental conditioning, the response usually occurs 
spontaneously—there is no stimulus that seems to elicit it—and this re- 
sponse is learned (occurs more frequently) when it is followed by some re- 
inforcement or reward. Thus, since emotional activity seems to be elicited 
by some more or less well-defined stimulus, it can be conditioned by the 
classical conditioning technique. 

Because of the reflex-like characteristics of emotional behavior, the first 
applications of Pavlov’s technique in American laboratories were to the 
conditioning of emotional behavior. Let us look at some of these early at- 
tempts to apply the classical conditioning technique to learning emotional 
behavior. 
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Classical Conditioning of Emotions 


The Watson and Raynor study. One of the best known of all these 
studies is by Watson and Raynor (1920). These investigators were inter- 
ested in the development of fears in infants, and they made a long series of 
Observations on a number of infants. One baby boy, who was about one 
year old at the time, was the subject of their conditioning study. Previously, 
they had discovered that a number of stimuli which one might ordinarily 
think would arouse fear were not fear producing for this youngster. White 
rats, dogs, masks, burning newspapers—none of these produced fear. One 
stimulus that did produce a startle reaction followed by crying and other 
Signs of emotional activity was a loud sound. Consequently, they decided 
to employ the loud sound as an unconditioned stimulus for emotional 
activity. 

In the actual conditioning procedure they paired the sight of a white rat 
(previously not fear producing) with the occurrence of the sound. A very 
few pairings were sufficient to elicit a full-scale emotional response simply 
by presenting the white rat alone. Interestingly enough, this conditioned 
fear generalized to other animals and to other furry objects, such as a wad 
Of cotton. 

This was a rather clear and dramatic demonstration of the possibilities 
Of conditioned emotions, as well as a demonstration of the importance of 
Such principles as stimulus generalization in the extension of the conditioned 
reaction. Watson and Raynor thought that this experiment could serve as a 
model for the way in which the complex and irrational fears of daily life 
might arise, and they placed much emphasis upon the importance of con- 
ditioned emotional reactions in the child’s personality development. 

In the intervening years, however, there have been enough studies of the 
developmental processes in young children so that we know Watson and 
Raynor's conclusions were a bit overdrawn. We now know that the specific 
stimuli for fear and other emotional reactions depend to a considerable ex- 
tent upon maturation and the particular developmental stage of the child. 
In part, the high specificity Watson and Raynor EEGLDNLEG to the emotional 
reactivity of the child was due to his age and maturation level. 

Other studies of emotional conditioning in children. One important 
Point from the work of Watson and Raynor stands up well, however. We 
know from subsequent investigations that emotional reactions can be con- 
ditioned by the classical technique. Jones (1931), for example, was able 
to show much the same pattern of conditioning in a fifteen-month infant by 
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pairing an electric shock and a buzzer. Jones studied the extinction of this 
conditioned response and noticed that overt reactions to the conditioned 
stimulus extinguished much more rapidly than signs of autonomic activity. 
Jones also established the existence of such phenomena characteristic of 
classical conditioning as external inhibition and spontaneous recovery. 

In another study (Jones, 1930), the same investigator conditioned the 
galvanic skin reaction in children by pairing a mild electric shock with a 
buzzer. The children in this study ranged in age from three to nine months, 
so that the investigator was able to establish the fact of conditioned emo- 
tional reactions early in infancy. Jones concluded that his studies rather 
clearly established the applicability of the classical Pavlovian conditioning 
technique to the modification of emotion in children. 

The importance of emotional conditioning. It turns out that these sim- 
ple examples of conditioned emotional reactions are extremely important in 
the theory of learning, particularly as the theory applies to situations in 
which emotional disturbances have occurred. The essential point of these 
studies is that emotional activity is conditioned by the classical technique. 
Since the emotional reaction is aroused originally by an unconditioned stim- 
ulus, there is no need to control it through reward. It is behavior that is, by 
and large, beyond voluntary control. Few of us can directly elicit emotional 
activity in ourselves in the same sense that we can open doors and recite the 
pledge to the flag. A few people can “voluntarily” cry, etc., but there is a 
strong belief that this is an indirect kind of control. That is to say, it may be 
possible to produce voluntary crying by thinking about something likely to 
arouse emotional activity. The crying is indirectly aroused by the presence 
of some internal (perhaps conditioned) stimulus. It is, incidentally, pos- 
sible to make use of “lie detection” techniques simply because emotional 
reactions usually beyond voluntary control can be aroused by external 
stimuli. 

Thus, the conditioning of emotional reactions seems to be largely a proc- 
ess of association. Organisms learn to become emotional in new situations 
simply because these situations occur when they are emotional for other 
reasons. At the beginning of this chapter, however, we pointed out that 
emotions seem to have some of the properties of motivating states such as 
hunger or thirst. Where, then, does this aspect of emotional reactivity fit in? 
Clearly not in the transference of emotional reactions themselves from an 
unconditioned to a conditioned stimulus, since this merely involves the as- 
sociation of the response elicited by both these stimuli. If the emotional re- 
action is itself motivating, then reduction of it ought to serve as reinforce- 
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ment for instrumental activity. This gives rise to some very important ques- 
tions, which we can best examine by looking at the problem of avoidance 


conditioning. 


Avoidance Conditioning and the Two-factor Theory 


Since experiments in avoidance conditioning have been at the root of 
most notions that have tried to describe the role of the motivating proper- 
ties of emotions in learning, we shall do well to begin our discussion by 
looking at this kind of conditioning. 

An example of avoidance conditioning. Avoidance conditioning causes 
much difficulty because it looks very much like classical conditioning, al- 
though, as we shall see, the resemblance is misleading. A simple example 
will illustrate this point. 

A dog is trained to stand perfectly still in an experimental apparatus 
Which permits the careful recording of the movements of all of its legs. One 
Of its rear legs rests upon an electrode that completes a circuit through the 
dog, so that it can be shocked through this leg. If the dog lifts its paw, of 
course, the shock is terminated. In the conditioning procedure, a buzzer 
Sounds about two seconds before a brief shock is applied to the dog’s leg. 
Soon after the conditioning procedure has begun, there are obvious signs 
of emotional disturbance elicited by the buzzer, and this makes it look very 
much like a simple example of classical conditioning. After a bit, however, 
the dog begins to lift its paw when the buzzer comes on. If the paw is kept 
up until the buzzer stops, the dog will not get shocked. After quite a few 
training trials, the dog lifts its paw to every sound of the buzzer, and thus 
never gets shocked. Therefore, it is quite clear that the paw-lifting response 
is instrumental; it prevents the dog from being shocked. 

This experiment starts out as an example of classical conditioning, but it 
clearly ends up with the characteristics of instrumental conditioning, for the 
response permits the dog to avoid the electric shock. 

This experiment could have been turned into an example of a “pure” 
classical conditioned response by strapping the electrodes to the dog’s leg 
S0 that it could not escape or avoid the shock by flexing its leg. Would this 
have made any difference? Certainly it would have influenced the frequency 


With which the leg response was part of a conditioned response. A rather 


Well-known experiment with guinea pigs illustrates this point. In this experi- 
ment (Brogden, Lipman, and Culler, 1938) a buzzer was sounded a num- 
ber of times, each time just before the guinea pigs received an electric 
shock. These animals were shocked in a revolving cage, and half of them 
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avoided the shock if they began to run when the buzzer was sounded. The 
remaining animals were shocked whether or not they ran. The results of 
this experiment can be seen in Figure 25. As this figure clearly shows, the 
animals that escaped or avoided the shock learned to run very quickly. 
Those conditioned exclusively by the classical technique, however, did not 
increase their frequency of running to the sound of the buzzer very much. 
‘This does not mean that they did not give “emotional” conditioned re- 
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Figure 25. A comparison of classical and instrumental conditioning of an operant. The solid line 
represents the frequency of conditioned responses to a buzzer when the animals (guinea pigs) are 
shocked for not running. The broken line represents frequency of conditioned responses when the 


buzzer is always followed by shock, whether or not the animals respond. (From Brogden, Lipman, 
and Culler, 1938.) 


sponses on these trials, it means merely that the instrumental act of running, 
since it was not reinforced, did not dramatically increase in frequency. 
Probably the increase in running that does occur under the classical con- 
ditioning technique is the result of the high level of activity that is part of 
the emotional conditioned response. 

The paradox of avoidance conditioning. Fora long time theorists wor- 
ried about an apparent paradox in avoidance conditioning. When an animal 


1 There is no implication in this experiment that classical conditioning is necessarily 
poorer than instrumental conditioning of the same response. See Kimble, Mann, and 
Dutfort (1955). 
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escapes a noxious unconditioned stimulus such as shock by responding, it is 
rewarding. But, why should avoidance of the noxious stimulus be reward- 
ing? How can a stimulus not experienced by the animal be said to be a 
Source of satisfaction or drive reduction? 

Common sense would say, of course, that the animal comes to anticipate 
the shock and that is why it responds to the buzzer. The learning theorist is 
not likely to be satisfied with this answer, however, since he will want to 
know how and why the anticipation develops. A number of theorists have 
attempted to answer this question, and one of the most widely accepted so- 
lutions is by Mowrer. Let us look at his notion about how avoidance con- 
ditioning develops. 

Mowrer’s two-factor theory of avoidance conditioning. Since Mowrer 
(1947) argued that avoidance conditioning is a two-stage process, his 
theory is known as a two-factor theory. He says that first of all the animal 
learns to give a classical conditioned emotional response to the conditioned 
stimulus. This, of course, is a matter of pure involuntary association; it 
does not depend on the animal's voluntarily selecting a response because it 
is drive-reducing. As we saw in the last chapter, however, emotional dis- 
turbance or fear produced by a neutral stimulus is itself a drive. If the ani- 
mal performs an instrumental response that reduces the emotional disturb- 
ance, this is a reinforcing state of affairs and in subsequent situations the 
animal will again choose this response. Thus, if flexing the leg to escape 
Shock also serves to reduce slightly the emotional responsiveness to the 
tone, it will be a response preferred by the animal. 

In a nutshell then, Mowrer’s theory says that the animal avoids the shock 
not just in order to avoid it, but to escape from the emotional state now con- 
ditioned to the buzzer. Actually, Mowrer was able to support his argument 
With some important evidence. What would happen, for example, if we 
altered the avoidance situation slightly? Suppose, instead of the buzzer al- 
Ways being sounded until the shock came on, the buzzer was terminated if 
the animal anticipated the shock by giving an instrumental response. Mow- 
rer and Lamoreaux (1942) showed that, if this were the case, learning of 
the avoidance response would be more rapid. Thus, if it is possible for the 
animal to escape the conditioned stimulus, learning to avoid the uncondi- 
tioned stimulus is easier. 

Because the buzzer is contiguous with the shock, to summarize the mat- 


ter, it comes to elicit emotional reactions that are unpleasant to the animal. 


Some response terminates the buzzer, and the emotional disburbance is 


Partially relieved. This relief constitutes need reduction, which is reinforcing 
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for the instrumental response. So really, one might say, the animal responds 
because the response terminates the unpleasant emotional activity con- 
ditioned to the buzzer. Such behavior has the appearance of anticipatory 
behavior—and Mowrer believes that he has solved in part the problem of 
anticipatory behavior—which has always been a stumbling block to learning 
theory. 

We need not necessarily accept the distinction between contiguity con- 
ditioning of emotional (primarily autonomic) responses and need-reduc- 
tion conditioning of instrumental responses, as does Mowrer. The need- 
reducing properties of the reinforcement for the instrumental response can 
be thought of simply as the instigator. Mowrer’s theory does necessarily 
imply, however, that the conditioned emotional response must occur before 
the animal can learn to avoid the unconditioned stimulus by making an in- 
strumental response. 

More about the two-factor theory. The two-factor theory has had enor- 
mous implications for the theory of behavior. It has suggested, to some bold 
theorists, that many of the mechanisms of neurotic behavior are learned in- 
strumental responses used by the individual to reduce in part some learned 
fear or anxiety. Thus it is important that we examine this notion in some 
detail. 

Solomon and Wynne (1954) have extended the two-factor theory to 
take into account some experimental data they believe to be important. 
One of these is the problem of extinction of the emotional reaction. If an 
avoidance habit is well established, the animal no longer receives painful 
stimulation, therefore the emotional reaction to the conditioned stimulus 
ought to extinguish and thus indirectly weaken the instrumental response. 
Solomon points out, however, that things do not always happen this way. 
Sometimes avoidance responses are extraordinarily resistant to the effects 
of extinction, and furthermore latencies of the conditioned avoidance re- 
sponse keep getting smaller even after the animal has received its last shock 
(Solomon, Kamin, and Wynne, 1953). One explanation for this, suggest 
Solomon and Wynne, lies in the fact that the animals perform the instru- 
mental response extremely quickly when the conditioned stimulus appears; 
so quickly, as a matter of fact, that there is no time for the conditioned 
stimulus to elicit the full-blown emotional response. Thus, they argue, the 
emotional response is conserved by extremely rapid instrumental respond- 
ing. Since this responding does not allow anxiety to develop, it is not fully 
reinforced, and the animal may lose interest in responding rapidly. Every 
so often, therefore, the reaction time for the instrumental response is rela- 
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tively long. It is protracted enough to elicit a full-blown emotional response, 
and when this happens the animal does perform the instrumental response 
and is reinforced. In addition, Solomon and Wynne argue that intense emo- 
tional conditioned responses are irreversible (not subject to extinction). This 
is an interesting suggestion, and we shall return to it later. 

Some of the further experimental implications of the two-factor theory 
are being intensively explored. Additional work by Mowrer and his associ- 
ates (Mowrer and Solomon, 1954; Mowrer and Aiken, 1954) has estab- 
lished the importance of temporal relationships between conditioned and 
unconditioned stimuli in the establishment of fear, and the freedom of such 
a response from any drive-reducing mechanism. Some difficulties with the 
two-factor theory do crop up, however. For example, Dykman, Gantt, and 
Whitehorn (1956) find that a component of the conditioned emotional 
response, conditioned heart rate changes, varies rather nicely with the in- 
tensity of the painful unconditioned stimulus, but the differentiation of this 
emotional response does not precede the differentiation of an instrumental 
avoidance response to the conditioned stimulus. Thus, the differences of 
instrumental responding to different intensities of painful stimulation do 
not seem to be dependent upon prior differences in conditioned emotional 
States to these intensities as perhaps they should be according to the theory. 
We may expect to see a good deal of experimental work on this and related 
Problems in the two-factor theory in the next few years. 


THE ROLE OF EMOTION IN CONFLICT 


There is a widely held notion that conflict between the anxiety-producing 
aspects of certain goals and their positive attractiveness is responsible for 
certain disturbances in behavior. Stated more generally, conflict is regarded 
as important in behavioral disorders. Consequently there have been rather 
intensive laboratory studies of the nature of conflict in an effort to establish 
4 relationship between the theory of conflict and Hore general theories of 
learning. In this section our aim will be to see how this experimental work 


has managed to determine a relationship. 


The Basic Principles of Conflict 

Lewin (1931, 1935) was responsible for the basic analysis of conflict 
behavior. His analysis, of course, was within the framework of general HO 
tions about fields of psychological forces responsible for the psychological 
State of the organism. Lewin has made a distinction between three types 
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of conflict. These are (1) approach-avoidance conflict, (2) approach- 
approach conflict, and (3) avoidance-avoidance conflict. Approach-avoid- 
ance conflict occurs when the same stimulus object is both desirable and 
unpleasant. An example of such conflict could be seen by shocking thirsty 
rats for drinking water. In approach-approach conflict, the organism is 
faced with the necessity of making a choice between two equally desirable 
goal objects—I can spend my savings for a new car or for a vacation. Fi- 
nally, in avoidance-avoidance conflict, the organism is faced with stimuli 
that are only to be avoided—I can go to the dentist and suffer in the chair, 
or I can avoid the dentist and endure a toothache. 

N. E. Miller (1944) has examined the implications of these definitions 
by making certain hypotheses about the relative strengths of approach and 
avoidance gradients. Miller points out that there are four principles funda- 
mental to the analysis of conflict behavior. These are: 


1. The tendency to approach a goal is stronger the nearer the organism is 
to the goal. 

2. The tendency to avoid an unpleasant stimulus is stronger the nearer 
the subject is to the stimulus. 

3. The tendency to avoid increases more rapidly with nearness to the un- 
pleasant stimulus than does the tendency to approach with nearness to a 
goal. 

4. These gradients of approach and avoidance are dependent upon the 
strength of the particular drive. With stronger drives, the tendencies to ap- 
proach goals or avoid unpleasant stimuli are much stronger. 


Brown (1948) was able to show that these principles do apply to the be- 
havior of rats. He trained a group of rats to run down a straight alley to food 
associated with the presence of a light. The animals wore a light harness 
which could be attached to a calibrated spring, so that if the animals were 
momentarily restrained, the strength of pull could be measured. In the ex- 
periments, Brown measured the strength of pull at two different points in 
the alley, at one point near the goal and at one point far from the goal. 
For a study of the avoidance gradient, the animals were shocked in the 
goal box and then the strength of the tendency to avoid the goal box was 
measured at two different points in the alley. 

The results of Brown’s studies can be seen in Figure 26. The avoidance 
gradient is steeper than the approach gradient, and increasing the drive, either 
by making the animals hungrier or by increasing the Shock, raises the level 
of both the approach and avoidance gradients. 
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Approach-avoidance conflict. If the same goal were made both attrac- 
tive and unpleasant, one would expect from the results of Brown’s experi- 
ments that the animals would approach part way and then stop and vacillate 
because the avoidance and approach gradients had crossed. Where this 
happened they would be about equal in strength, and thus the animals 
would be subject to an almost equal pull in opposite directions. This is 
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Figure 26. The solid line is an approach gradient. The dashed lines are avoidance gradients of 
different strengths. Conflict is strongest where the avoidance gradients cross the approach gradients. 
Note that the strong avoidance gradient crosses the approach gradient further from the goal than 
does the weak avoidance gradient. (From Miller, 1944; data from J. S. Brown.) 


actually what happened in an experiment (Miller, Brown, and Lipofsky, 
1943). Animals ran to a goal box in order to get fed, and they were shocked 
While they ate. Thus, both an approach gradient and an avoidance gradient 
were produced. The results of the experiment showed that if the shock were 
very strong, the animals would run only a little way down the alley and then 
hesitate; if the shock were weak, they would run almost the full length of 
the alley before they stopped. Thus, the principles tested by Brown are 


Verified in this experiment. a 
Avoidance-avoidance conflict. The avoidance-avoidance conflict situa- 
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tion is one of the most interesting of the cases. In this kind of conflict vacilla- 
tion is maintained the longest. Avoidance-avoidance conflict makes the organ- 
ism choose between two undesirable alternatives. If it moves away from one 
Of the alternatives toward the second, the avoidance tendency to the second 
becomes greater and the organism will tend to move back toward the first 
again. This kind of situation Miller calls a stable equilibrium. Two experi- 
ments (Klebenoff, 1939; Hunt, 1943) demonstrate clearly the tendency 
toward vacillation as well as a tendency to try to leave the field. 

Approach-approach conflict. Miller points out that approach-approach 
conflict will produce little vacillation—it is essentially an unstable equi- 
librium. Once the conflict is resolved (perhaps by a purely chance choice), 
the organism tends to approach one of the goals with increasing strength, 
since, of course, as it gets closer to that goal the attractive pull of that goal 
becomes greater and the attractive pull of the alternative goal becomes 
weaker. 

Not all experimental results agree in showing that approach-approach 
conflict works this way, however. Godbeer (1940) reports considerable 
vacillation in approach-approach conflict in children. Williams (1943) says 
that this is because the child can anticipate symbolically the frustration of 
losing one of the goals, something a rat cannot do. Williams points out that 
many supposed cases of pure approach-approach conflict also involve an 
element of avoidance. It is not to be expected that approach-approach 
situations will produce no vacillation. 

Miller recognizes this, and he suggests that approach-approach situations 
sometimes involve another kind of conflict, a conflict he names “double 
approach-avoidance.” This occurs because as one goal is approached the 
other becomes more distant and the Organism wishes to avoid giving up 
this goal. The net result is that the approach tendency may become rela- 
tively weak as the organism approaches the goal. The distant goal seems 
more attractive, and the organism may turn back to it. 


Further Applications of the Principles of Conflict 


The problem of displacement. One application of the principles of con- 
flict has been to displacement. Displacement occurs when an act of aggres- 
sion cannot safely be directed toward its instigator, but instead gets diverted 
to some innocent object. Thus the man who is angry at his boss inhibits 
expressing aggression to the boss and instead, expresses it to his secretary. 
Displacement would occur only if we assume that the response of inhibiting 
the aggression (avoidance) shows a much steeper curve of stimulus general- 
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ization than the displaced (generalized) aggression response itself. Miller 
and his associates (Miller and Kraeling, 1952; Murray and Miller, 1952) 
were able to establish that this is the case for a simple maze habit in rats. 
Thus, there is considerable justification for extending the analysis of conflict 
to the problem of displaced reactions. 

Conflict and anxiety. In conflict that involves avoidance tendencies, 
there is nearly always some level of conditioned emotional behavior present. 
This is because, as we saw earlier, the avoidance tendency itself depends 
upon the conditioned emotional reaction. Usually avoidance means the emo- 
tion of fear or of anxiety, so in any conflict involving avoidance one of 
these is present. In addition there has been a widespread belief that the con- 
flict per se is the source of additional fear or anxiety. We shall examine this 
notion when we come to look at the problem of experimental neuroses. It 
serves to point out, however, the great importance of the analysis of the 
conditions of conflict. 

The spatial analogy in the analysis of conflict. Most of the experiments 
On conflict we have discussed are experiments in which actual spatial dis- 
tance from the goal was an experimental variable. The space, in these cases, 
is physical. However, from Lewin has come the notion that conflict can be 
conceived of in the framework of forces within a psychological space. This 
notion of psychological space is implicit in much of our thinking about con- 
flict. For example, when we speak of avoiding the dentist versus avoiding 
the toothaches as an example of conflict, we implicitly conceptualize this 
Situation in an imaginary psychological space. We see the individual as a 
Point in this space surrounded by barriers and buffeted by two opposing 
forces. The analogy to experiments on rats in physical space would have 
much greater force if we could deal with this imaginary psychological space 
in the same way that we deal with physical space. This is SS of the great 
unsolved problems in the analysis of conflict, and the fruitfulness of this 
approach to more general cases of psychological conflict will depend upon 
the degree of success we have in solving this problem. 


THE PROBLEM OF PUNISHMENT 


Behavior is punished when some painful stimulation or the threat of such 
Stimulation is made contingent upon it. Thus, if I punish my dog by cuffing 
it for chewing up the newspapers, in effect I have made the cufiing con- 
tingent upon chewing newspapers. Usually punishment is administered to 
Moderately well-motivated behavior in order to stop the behavior. Thus 


136 THE PSYCHOLOGY OF LEARNING 


punishment is a technique for one organism controlling the behavior of 
another, though, of course, some particular behavior of an animal or human 
being may be inadvertently punished by a physical accident. In this case a 
pseudocorrelation between the behavior and its punishing consequences is 
set into the organism. 

The essential property of punishment, then, is that it is painful or un- 
pleasant stimulation applied to moderately well-motivated or highly moti- 
vated behavior. The rationale behind the social use of punishment is that it 
will stop undesirable behavior, though our modern insights into personality 
mechanisms lead us to suspect that punishment is sometimes administered 
to satisfy some need of those who inflict it. As a problem in the theory of 
learning, however, punishment is best seen as the influence of painful or un- 
pleasant emotion-producing stimulation upon behavior. 


The Classical Experiments on Punishment 


For many years experimental and theoretical work on punishment was 
dominated by the views of the late E. L. Thorndike. He was, for most of 
his professional life, an educational psychologist, and consequently his con- 
cern with the problem of punishment was influenced by the formal demands 
of the classroom. In a word, he was concerned with the influence of verbal 
praise and reproof upon behavior. Since his views and experiments were 
influential, let us look briefly at them. 

Thorndike’s views and experiments. In his earliest writings, Thorndike 
simply adopted the common-sense view that punishment would reduce the 
tendency to repeat behavior it immediately followed (weakened connec- 
tions, as Thorndike put it). Later, however, Thorndike (1932a) examined 
the problem more thoroughly and came to another conclusion. He decided 
that punishment really had no inherently weakening effect on behavior. 
Let us see how Thorndike arrived at this rather surprising conclusion. 

Thorndike performed a number of experiments on punishment, with 
both animal and human subjects. As his experiments on human subjects are 
better known, let us review one of these. Thorndike (1932b) asked subjects 
who were not familiar with Spanish to guess the correct English equivalent to 
a Spanish word. He gave a series of such words in the form of a multiple- 
choice test. If the subject chose the correct word, the experimenter in- 
formed the subject by saying “right”; if the word was incorrect, the ex- 
perimenter said “wrong.” It was possible to find the influence of “ight” 
or “wrong” by finding out, on further testing, if the subjects gave the pun- 
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ished or rewarded responses with a frequency greater or lesser than that 
expected by chance. Since there were five alternatives for each item, the 
frequency of repetition by chance would be 20 per cent. This was the base 
line against which Thorndike measured the effect of saying “right” or 
“wrong.” As it turned out, “right” did increase the tendency to repeat the 
initially given response. The consequence “wrong”, however, rather than 
being weakening, seemed to Thorndike to have a slight strengthening effect. 

Criticisms of Thorndike. Many similar experiments by Thorndike 
(1935) led him to the same conclusion. Other investigators leveled serious 
criticisms at him, however. First of all, they argued, the use of “chance” as 
a base line against which to measure the effects of reward or punishment 
is not good; it is extremely unlikely that responses given by subjects in such 
tests are at the chance level; rather responses £0 in “runs,” or sequences, 
to which the one-in-five chance expectancy would not apply. Stephens 
(1934), for example, used an empirically determined probability of repeti- 
tion by including items to which the experimenter gave no reward or pun- 
ishment. The frequency of repetition without any consequences turned out 
to be 36 per cent instead of the 20 per cent assumed by Thorndike. Meas- 
ured from this base, punishment did have a slight weakening effect (given 
the mildness of the punishment and the low motivation for repetition we 
should not expect a large effect). 

Punishment indirectly weakens. Thorndike argued, however, that 
Wherever punishment appeared to weaken a response it WAS an indirect ef- 
fect. This happens because punishment produces variability of behavior. 
In maze learning, for example, punishment of AerTor may lead the learner 
accidentally to make the correct response, and this response indirectly elimi- 
nates the punished response, because the correct response 1s strengthened 
due to its effect. It has never been precisely clear what Thorndike meant 
by the indirect weakening of punished responses (see the controversy, 
Postman, 1947; Stone, 1948) but it would be something like this: Punish- 
ment may or may not weaken a response, but it clearly cannot be the mirror 
image of the action of a reward. If a response is rewarded, it is apparent to 
the organism in a typical experimental test that LEBEATON of this response 
Will again be rewarding, but if a response is punished, it is not clear to the 
Organism which of the other available responses will be rewarded. Thus, 
behavior after punishment will be more variable, e.g., the Bredicron of the 
exact response to be made by the organism will be more difficult. This is 
the most general statement we can make. In the Thorndikian situations the 
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punishment is a mild one, and perhaps its main function is to provide in- 
formation. If this is so, reproof for making a particular response does not 
supply as much information as praise, for it does not tell the subject which 
response is correct. 


Punishment in Experiments on Animal Behavior 


Many of the important conditions of punishment are ignored in the 
Thorndikian experiments. The magnitude and quality of unpleasant effects 
are ignored, and, perhaps more importantly, the effects of restricting the 
available responses are ignored. This last point needs some amplification. 

In Thorndike’s experiments the learner may satisfy his motivations by 
making responses other than the ones punished. He is seldom interested in 
defending his original response as the “right” one, and he will readily aban- 
don it when he knows it is “wrong.” Thus, the effect of punishment is to 
produce variability of behavior, as Thorndike suggested. 

What happens, however, when the only response which satisfies a drive 
is punished? The child who is punished for taking cookies from the cup- 
board knows no other way to satisfy a craving for sweets. No matter how 
many raps on the knuckles the child may receive, the cookies lose none of 
their attraction, and he is not likely to find another way of getting cookies. 
There are interesting experiments upon the behavior of rats that bear di- 
rectly on this problem. 

Skinner’s experiment. Skinner (1938) studied the effect of punishment 
upon the lever-pressing response in the Skinner box. In his experiment, the 
punishment was a sharp slap on the paws from the lever itself. To measure 
the effects of punishment, Skinner obtained extinction curves following pe- 
riodic reinforcement with food from four rats. Of these rats, two were 
slapped at the beginning of the extinction period and two were not. The 
result was that the rats slapped at the outset of extinction showed a much 
lower rate of responding during extinction. As a matter of fact, the punish- 
ment seemed to inhibit the response entirely for a period of time. The 
curious thing, however, is that by the time extinction had been completed, 
the slapped rats had caught up, in total number of responses emitted, with 
the rats not slapped. These results can be clearly seen in Figure 27. 

Skinner interpreted the results of this experiment to mean that punish- 
ment has only a temporary effect upon behavior. It does not affect the “re- 
serve” of responses which the animal will eventually emit; it affects only the 
rate at which this reserve will be exhausted. Early in extinction the punished 
animals show a lower rate of responding, but eventually the rate for these 
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animals accelerates so that they show a total number of responses equal to 
that of the unpunished animals. 

Estes’ experiments. Estes (1944) followed Skinner’s experiment with 
an intensive investigation of the factors affecting punishment of an isolated 
instrumental response—lever pressing in the rat. In these experiments he 
substituted an electric shock for the slap. The shock was delivered through 
the lever, so that rats used to receiving food as the result of lever pressing, 
sometimes received shock instead. 

Estes’ first conclusion is much like Skinner's. The effect of punishment 
is primarily upon the rate of responding rather than on the over-all tendency 
to emit a particular response. Estes did find, however, that if the punish- 
ment were intense enough or lasted long enough, there was a slight 
permanent depression in the reserve of available responses. Nevertheless 
Punishment was never able to eliminate the extinction curve completely— 
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Figure 27. The effects of punishment upon resistance to extinction. The initial effect of punish- 
ment is to reduce the rate of responding. There is, however, recovery, and eventually the total num- 
ished and control animals. (Skinner, 1938.) 


ber of responses is about the same for the pun 


ack to respond after punishment, even though they were 
ngly enough, when Estes punished his rats 
Periodically instead of for every response, the depression of the rate of re- 
sponding was not nearly So severe, though it lasted longer. Thus, the results 
Of periodic, or partial, punishment are somewhat analogous to the results of 
Partial reinforcement. | 
Punishment and conditioned emotional responses. Estes interprets the 
effects of punishment somewhat as follows: anxiety or fear aroused by the 
shock becomes conditioned to the stimuli in the experimental box. This 


anxiety, most strongly conditioned to the visual stimulus of the lever, causes 
’ 


the animal to withdraw. Withdrawal reduces the anxiety and is therefore 


rats always came b 
never again reinforced. Interesti 
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reinforcing. In Estes’ terms, anxiety becomes prepotent over the lever- 
pressing response, which consequently disappears for a time. However, 
since the animal does not press the lever, the anxiety conditioned to the 
lever extinguishes and the animal gradually returns to pressing the bar. Un- 
less the response is again punished, the anxiety gradually diminishes, allow- 
ing the animal to exhaust the reserve of lever-pressing responses it built up 
during food reinforcement. 

Thus, in this experiment, the effectiveness of punishment seems to come 
from an emotional reaction conditioned to the environment in which the 
rewarded instrumental response is likely to occur. The emotional reaction 
prevents, or at least reduces, the tendency for the emission of the instru- 
mental response. When the emotional reaction extinguishes, as it should 
if the animal is not punished again, the tendency to perform the instrumental 
response returns. 

Punishment, then, seems to be a rather special case of avoidance learn- 
ing. This is pointed out by Dinsmoor (1954) and he shows that a fruitful 
analysis of the possible effects of punishment can be obtained by studying 
experiments in avoidance learning. He brings out that the operation of pun- 
ishment is essentially a two-step process as it is in avoidance learning." 

The use of the suppression period produced by punishment. It is clear 
that punishment may not be too effective in completely eliminating some 
behavior. Punishment may momentarily weaken a response, but it does 
nothing to eliminate the behavior permanently. This conclusion seems to 
be in accord with the clinical and anthropological evidence about aggres- 
sive behavior punished by parents or by some agent of society. The tend- 
ency toward aggression is not removed until it can be brought to free 
expression, and thus special psychotherapeutic techniques are frequently de- 
vised for the express purpose of allowing aggression to occur in a displaced 
(therapeutic) situation. 

It is possible, however, to use the temporary period of response inhibi- 
tion produced by punishment to teach an organism a new method of reach- 
ing the same goal. An experiment by Whiting and Mowrer (1943) suggests 
that this may be a valuable technique in the control of behavior. 

These investigators studied ways of getting rats to abandon old, preferred 


2 Hunt and Brady (1955) have compared the effectiveness of punishment and a 
conditioned emotional response in suppressing instrumental behavior. Their results 
show that the conditioned emotional response is more effective, because of the more 
adequate stimulus ‘control it exerts over behavior. 
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adjustments in favor of new ones. They used three groups of rats. All three 
were taught to run a simple maze. After the habit of running to the goal 
box had been acquired, they extinguished it in one group of rats by omitting 
the reinforcement. They continued to reinforce the habit for the second 
group, but placed a barrier in the path leading to food. In the case of the 
third group, they punished the running response by an electric shock. The 
situation was then changed so that all three groups could find the way to 
the goal by an alternative route. The animals that had been punished on the 
original habit showed the fewest errors in reaching the goal by the alterna- 
tive pathway. They also showed less regression to the original habit after 
the alternative habit had been learned. 

Thus, teaching an alternative response during the period of suppression 
induced by punishment could be an effective means of redirecting behavior. 
Of course, as we might expect, if the second habit is extinguished, there will 
be a tendency to revert to the original, punished habit. 

The effects of traumatic punishment. When we were looking at the prob- 
lem of avoidance learning, we saw that Solomon and Wynne (1954) sug- 
gested that sometimes the conditioned emotional reaction set up by painful 
stimulation was extremely resistant to extinction—indeed perhaps never 
could be extinguished. If the same thing applied to punishment, it is pos- 
sible that a punished response would never recover its strength. The experi- 
ments on avoidance learning that Solomon and Wynne were talking about 
were experiments in which very traumatic levels of electric shock had been 
used. In the experiments on punishment we have looked at thus far, the 
punishment was relatively mild. Consequently, it is possible that under a 
really traumatic punishment, the instrumental response punished may never 
recover its original strength. 

This seems to be the case in some studies of Masserman (1943) on pun- 
ishment in cats. As we shall see shortly, the experiments of Masserman 
suggest that the effects of such punishment may go far beyond the instru- 
mental response punished; there may be changes in almost all behavioral 
adjustments of the animals. This is the case in the experimental production 


of neurosis, the problem we shall consider next. 


EXPERIMENTAL NEUROSIS 
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duce disturbances in behavior by experimental conditions in the laboratory. 
In this section we shall look at some of the experiments and their resultant 
theories. 


The Experimental Production of Neurosis in Animals 


Pavlov’s experiments. Some observations by Pavlov (1927; 1928) pro- 
vide the introduction to the technique of producing “experimental neurosis.” 
While studying the conditioned salivary response, workers in Pavlov's lab- 
oratory noticed that animals sometimes developed long-standing disturb- 
ances of behavior. Pavlov began to study these disturbances seriously after 
an experiment in which he attempted to condition salivation by using elec- 
tric shock as a conditioned stimulus. The electric shock seemed to work 
fairly well for a while, but after an attempt at generalization, the conditioned 
salivation was replaced by a violent struggle quite foreign to the usual be- 
havior of the dogs. 

A subsequent experiment led Pavlov to an intensive study of disturbances 
of behavior. This experiment started out as an attempt to test the limits of 
form discrimination in a dog. The animal was trained to salivate to the 
presentation of a luminous circle on a screen. Then an attempt was made 
to get the animal to discriminate between a circle which was reinforced and 
an ellipse which was not. After discrimination was established, the ellipse 
was gradually changed to look more like the circle. For a while, the dog 
continued to discriminate, but then as the two stimuli became more alike, 
discrimination suddenly became very poor. In addition, the whole character 
of the animal’s behavior changed. The dog, which had been trained to stand 
quietly in the apparatus, began to struggle and squeal. Eventually it became 
quite violent and resisted being taken to the experimental room. This be- 
havior is what Pavlov called an experimentally produced neurosis. 

Pavlov believed that the primary cause of this disturbance of behavior 
in the test situation was a conflict in the cerebral cortex between opposed 
forces of excitation and inhibition. The excitation was produced by the 
positive conditioning of one stimulus and the inhibition by the extinction 
of the other. Both conditioning and extinction occur in discrimination, and 
when the stimuli get close to each other, the conflict between the opposing 
forces of excitation and inhibition becomes strong enough to disturb the 
whole cortex. 

Pavlov’s general notions about cerebral physiology have more or less 
fallen into disrepute as we have come to know more about the facts of 
cerebral activity. His interpretation of neurosis in terms of conflict, how- 
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ever, is still influential. If we were to rephrase Pavlov’s theory in language 
less tainted with outdated physiology we might say that experimental neu- 
rosis occurs because of the inability of the animal to tell when to respond 
and when not to. This conflict could be the result of a discrimination that 
is too difficult for the animal. It should be noted in passing, however, that 
such a conflict can arise from causes other than difficult discrimination. If 
the animal must delay a conditioned response too long after the presenta- 
tion of the conditioned stimulus (because of a delay in presentation of the 
unconditioned stimulus) there will be a conflict between the tendency to 
respond and the tendency to inhibit the response until the appropriate time. 

Other investigations of experimental neurosis. Since Pavlov'’s original 
Observations there have been many other attempts to produce disturbances 
in the behavior of animals in the laboratory. Gantt (1936) has closely fol- 
lowed Pavlov’s work, and he has been able to confirm Pavlov’s observa- 
tions on disturbances of behavior during difficult discrimination. 

In studies by Liddell (1944) the conditioned flexion response has been 
used rather than the salivary response. In experiments on sheep, dogs, and 
Pigs, Liddell and his associates have been able to produce many behavioral 
disturbances in the laboratory as well as disturbances in bodily functioning. 
In these experiments there has been great emphasis placed upon the role of 
restraint of the experimental animals during testing. Restraint, monotony 
Of stimuli, and other factors are responsible for the disturbances, according 
to Liddell. It is interesting to note that Liddell finds that many of these 
disturbances persist when the animals are removed from the experimental 
situation. 

Some of the most interesting observations of experimentally produced 
disturbances of behavior come from Masserman (1943). Masserman used 
an instrumental response in producing his disturbances. Cats were trained 
to open a small food box after a stimulus was presented. The techniques 
used to produce the disturbances of behavior were frustration, conflict, and 
Punishment. Suffice it to say that by these techniques Masserman was able 
to produce markedly deviant behavior in his cats. Some cats became 
“phobic” toward the box, particularly after punishment. Other cats went 
into semicataleptic states, refusing to move or even to eat for long periods 
of time. Masserman attempted a variety of therapeutic devices to relieve the 
neurosis after it had been established. Rest was about the most effective 
means of allaying the animals’ symptoms. Also effective were reducing the 
hunger drive through forced feeding outside of the experimental room, “re- 
assurance,” and social imitation of nonneurotic cats. 
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Theories of Experimental Neurosis 


Today there is less emphasis upon trying to find analogies between dis- 
turbed behavior in animals and that in human beings than there was a few 
years ago. Nearly all psychologists realize that a technique used to produce a 
disturbance in an animal need not achieve the same effect in human beings. 
There is less concentration upon techniques which produce disturbed be- 
havior in animals than upon the production of conflict between two habits 
or upon the production of an inhibiting fear in animals. Let us look at some 
of the notions about disturbed behavior in animals, not with the idea of find- 
ing analogies to similar behavior in human beings, but with the idea that 
we might be seeing simple models of some of the processes at work in hu- 
man neuroses. 

Masserman’s theory. Of all of the recent theories of behavioral dis- 
turbances based upon experimental studies of animals, Masserman’s comes 
closest to considering disturbances in animals to be analogous to those in 
human beings. He emphasizes the fact that traumatic situations in the ex- 
perimental tests are responsible for phobic reactions and fixations, much 
as they are supposed to be in the case of human personalities. The symp- 
toms of withdrawal and confusion exhibited by his animals, Masserman be- 
lieves, occur because of the conflict between the motive to get food and the 
anxiety associated with the punishment of the food-getting responses. 

While Masserman does not explicitly point up the relationship, his anal- 
ysis comes very close to the kind we should expect from our earlier dis- 
cussion of basic learning mechanisms in emotional behavior. In addition, 
however, he points out the many parallel points between disturbed personal 
and social adjustments in human beings and what he believes to be their 
counterpart in animal behavior. Though Masserman’s views of the mecha- 
nisms in disturbed behavior do not provide us with a systematic theory, 
they are important in that they represent the first real attempt to bring to- 
gether psychiatric and psychoanalytic theories with the experimental study 
of animal behavior. 

The Dollard and Miller theory. A systematic account of the relation- 
ship between problems of emotional learning in animals and behavior dis- 
orders in human beings is presented by Dollard and Miller in Personality 
and Psychotherapy (1950). These writers are not specifically interested in 
presenting a theory of experimental neurosis, but they are interested in ap- 
plying the basic principles of learning to the description of human per- 
sonality disorders. Though their aim is somewhat different, the general re- 
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sult of bringing together experimental observations of animal behavior and 
observations of human beings in trouble is the same. 

Perhaps the most important principle that Dollard and Miller use is 
approach-avoidance conflict. People frequently find themselves in the un- 
fortunate position of having conditioned fear responses aroused by situa- 
tions which can reduce their positive drive. In other words, there is conflict 
between fear and desires. Dollard and Miller discuss in detail the case his- 
tory of an individual who suffered from conflict between the fears instilled 
by a repressed childhood and her normal, adult sexual desires. Since the fear 
actually prevented this woman from satisfactorily satisfying her positive 
needs, misery and anxiety arose. Furthermore the fear produced repression 
of fear-arousing thoughts and images so that the fear itself was prevented 
from undergoing extinction. The net result was a vicious circle in which no 
opportunity was given for the reduction of fear or the elimination of con- 
flict. This, say Dollard and Miller, is the general scheme for disturbances 
in personality and the production of psychosomatic symptoms. 

Dollard and Miller also apply the principles of learning to the process 
Of readjustment of maladjusted people. They present an account of the course 
Of psychotherapy in one individual and point out how each aspect of psycho- 
therapy works in terms of the principles of learning. They show, for exam- 
Ple, how talking about a problem in a permissive and relaxed atmosphere 
could result in extinction of the fear responses and consequent resolution of 
a conflict. They likewise interpret the transference that takes place during 
therapy as a special case of stimulus generalization, and show how dis- 
crimination learning must take place if the individual is to distinguish be- 
tween appropriate and inappropriate behavior in various life situations. 

Maier’s theory. The position advanced by Dollard and Miller really 
needs no new principles to describe the mechanisms in disturbed behavior; 
the learning principles discussed thus far in this book are perfectly adequate. 
We should, however, look briefly at one theory of experimental neurosis 
that does invoke a new principle. This is the theory presented by Maier 
(1949). In experiments by Maier and his associates, rats are required to 
discriminate between two visual stimuli presented on the Lashley jumping 
stand. The stimuli are reinforced and punished in a random manner, so that 
the rats really cannot learn or find a solution to the problem. Animals fre- 
quently refuse to choose in this situation, even though they may be very 
hungry. Thus it is sometimes necessary to “drive” the animals to choose by 
Stimulating them with a blast of air (which emits a piercing sound painful 
to rats) or an electric shock. When this happens and the rats are forced to 
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respond, they usually fixate on a particular response—jumping to the left 
card or jumping squarely at the middle, missing both cards. Once this re- 
sponse becomes fixed, it never changes. This Maier has called “abnormai 
fixation.” 

The animals in these experiments, Maier says, are frustrated, and this is 
responsible for the fixation. Animals, he finds, that have developed a fixa- 
tion may even refuse to jump to an open window in which food is displayed. 
Rather, they prefer their “fixated” response. Maier believes, moreover, 
that his experiments show that behavior elicited during a state of frustration 
has certain unique properties. Frustrated behavior is not motivated behav- 
ior in the usual sense; it solves no problems and has no goals. Aggression, 
regression, and fixation are all characteristics of frustrated behavior. Since 
behavioral disturbances can be described in terms of aggression, regression, 
and fixation, Maier believes that a major portion of the disturbances in be- 
havior reported in studies on experimental neurosis are due to frustration. 

The strength of the fixations reported in experiments on the Lashley 
jumping stand is remarkable. Making available the solutions to the problem 
does not break up the fixation (Maier, 1949), and even training the rats 
to walk to the correct stimuli does not eliminate it (Feldman, 1953). Strong 
Punishment, persistently administered, however, does seem to break the po- 
sition stereotypes of fixations (Knopfelmacher, 1953). 

Thus Maier introduces a new concept, frustration induced fixation, and 
he, as well as others, has produced a good deal of evidence to support this 
notion. It is an interesting one, and only time and new evidence will tell us 
whether such a concept is necessary or whether all the things it purports 
to explain can be clarified by the ordinary principles of learning. Perhaps 
the greatest importance of the notion, so far as we are concerned at the mo- 
ment, is that it suggests that the laws of adaptive behavior break down under 
the stress of frustration. 

Some comments on experimental neurosis. We can see, from this brief 
survey, that many basic principles in the Psychology of learning have been 
put to work answering questions about the causes of disturbed behavior. 
We have also seen that experimental work on animals sometimes leads us 
to formulate new principles which, in turn, may be applied to problems of 
disordered behavior in human beings. 

It should be pointed out that there is a vast gulf between the disturbed 
behavior of animals in the laboratory and the disturbed behavior of indi- 
viduals in trouble. In most studies of experimental neurosis, disruption of 
behavior has been produced by stimuli that elicit strong emotional re- 


EMOTION AND LEARNING 147 


sponses. These disturbances, as well as the emotional behavior, are situa- 
tional. The animals may be disturbed in the laboratory but they are less 
or not at all disturbed when out of the laboratory. Human neuroses are not 
situational. Thus, in the matter of behavioral disturbances the rat has the 
advantage over the human being. The rat does not carry its anxieties around; 
rather its anxieties are conditioned to specific stimuli, and it shows signs of 
anxiety only in the presence of these or similar stimuli. The human being— 
through the mediation of verbal behavior—carries his anxieties about and 
uses implicit verbal behavior to reinforce them. Thus, perhaps the major 
stumbling block to a direct analogy between disturbed behavior in labora- 
tory animals and in human neurotics is the low capacity animals have for 
engaging in symbolic and linguistic behavior. Human beings can much more 
readily generalize and extend their fears and conflicts because they can 
symbolically face fear of punishment or conflict even when the objective 
situation does not demand it. This fact and others make us aware of the 
enormous gulf that exists between experimental neuroses of the laboratory 
and disturbed personalities. We should not ignore, however, the extent to 
which animal behavior in the experimental laboratory can provide us with 
models of basic mechanisms in personality maladjustments. The extraordi- 
nary degree to which the results of simple experiments can be used to 
describe personality processes is well illustrated in Dollard and Miller’s pre- 
viously mentioned Personality and Psychotherapy (1950). Nor have Dol- 
lard and Miller stood alone in this endeavor; Mowrer (1950) and others 
have explored much the same course, and our understanding of basic mech- 
anisms in the dynamics of individual personality has been enriched by 
their work. 
STRESS AND LEARNING 


Before we leave the topic of emotion we have one important problem to 
consider, the effects of emotional activity upon learning and the perform- 
ance of learned skills. Emotional activity arises when the organism is under 
Some stress. Examples of stress are familiar enough—the navigator of a 
modern bomber might be troubled by the threat of physical injury and the 
need for quick, precise performance of his task to ensure the success of his 


mission. 


Emotion and Motivation in Stress 
It is obvious that stress involves components other than emotional activ- 
ity. It is a state in which some motive or motives remain ungratified; this 
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may mean failure to avoid or escape threat (fear) or it may even mean 
lack of satisfaction of a powerful appetitive desire (sex, for example). Thus 
stress means heightened motivation as well as heightened emotional activ- 
ity. What is more, in experimental studies of human behavior, it is prac- 
tically impossible to produce emotional disturbance without also producing 
a change in motivation (Lazarus, Deese, and Osler, 1952). Thus, we can- 
not examine the effects of emotional arousal upon learning and skilled per- 
formance without taking into consideration the effects of change in motiva- 
tion. This point was raised in the last chapter in the section on experimental 
studies of human motivation. A number of studies were reviewed in which 
investigators had tried to relate individual differences in motivation level 
to learning and performance on various tasks. The difficulty with these 
studies, remember, was lack of agreement as to whether or not the measures 
of individual differences used by these investigators actually assessed moti- 
vation. Some workers point out that these studies really were concerned 
with the relationship between performance and individual differences in 
emotionality and performance. 

Unfortunately there is no way out of this dilemma at present. Perhaps, 
from a strictly operational point of view, we have no justification for talking 
of the effects of emotion and motivation as being independent of each 
other, at least as far as human skilled performance is concerned. There is 
evidence from other sources, however, that emotional activity and motive 
states do derive from somewhat independent sources, and given the present 
general viewpoint in psychology, we find it convenient to talk about emo- 
tion and motivation separately. 

Roughly stated, level of motivation is responsible for level of perform- 
ance. In other words, higher motivation leads to more adequate performance 
if the latter leads to eventual satisfaction of the motive. Emotional arousal 
provides activity which for all but a small number of skilled acts is likely 
to produce interference. Trembling, gastrointestinal upset, etc., are all 
things that may distract from or directly interfere with some particular task 
an individual is trying to perform. Consequently, in the vast majority of 
specific instances, we should suppose that emotional arousal would have a 
detrimental effect upon skilled performance. 

What, then, of the effects of stress? Stress, as we saw, involves both mo- 
tive and emotional components. One very possibly would bring about im- 
proved performance, and the other, deteriorated performance. What actually 
happens when individuals are stressed in an experimental test and the effects 
of this situation upon performance evaluated? 
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Experimental Studies of Stress and Performance 


As the confusing roles of emotion and motivation in stress would lead 
us to imagine, there is no simple way of stating unequivocally that stress 
will have specific effects upon learning and the performance of particular 
skilled acts. There are few stable generalizations that can be made. 

One which seems to receive support from a wide variety of experimental 
investigation is that stress increases the speed with which repetitive tasks 
are carried out (Lindsley, 1946; McKinney, et al., 1951). This increase in 
speed is generally accompanied by a higher rate of errors. The net result 
is that some people actually improve their performance because of the 
greater speed, and others decrease in performance because of an increase 
in errors (Lazarus and Eriksen, 1952). 

Furthermore, we suspect that individual reactions to stress are de- 
termined in part by stable personality patterns. In the last chapter we dis- 
cussed some studies of the relationship between performance on skilled 
tasks and scores on personality inventories. We saw that these personality 
inventories probably assess individual differences in motivational as well as 
emotional factors. Therefore, a thoroughgoing exploration of the relation- 
Ship between personality inventory scores and behavior under stress ought 
to provide us with some interesting and analytic data about stress and per- 
sonality. A bare beginning to this study has been made (Deese, Lazarus, 
and Keenan, 1953; Osler, 1954). Perhaps studies of personality charac- 
teristics and behavior under stress will provide a means of separating the 
motivational and emotional components in the reaction to stress. 

There is tentative evidence that poorly organized habits are more suscepti- 
m stress than well-established habits. Information 
are little influenced by stress, but 


ble to deterioration fro 
items on an intelligence test, for example, 
the immediate memory span (memory for things heard or seen just once) 
is susceptible to stress (Moldawsky and Moldawsky, 1952). Also, it seems 
likely that performance early in the learning of a new skill is influenced 
more by stress than is later performance (Deese and Lazarus, 1952). 
Beyond these simple statements it is difficult to go. It is likely that a more 
thorough analysis of the effects of stress upon learning and performance 
will depend upon new experimental techniques and new ways of assessing 
individual differences in emotional reactivity and motivational level. 


CHAPTER 7 


SERIAL LEARNING 


A basic fact of behavior is that it is sequential; acts follow one another in 
a continuous stream. One of the central problems for the psychology of 
learning is to consider how this stream of behavior becomes organized. As 
Wwe try to solve this problem we shall draw upon many basic principles al- 
ready discussed, as well as upon new principles and ways of studying be- 
havior. Therefore, this chapter is about the possible applications of the 
basic principles of instrumental learning to particular facts of serial or 
sequential patterns of behavior, and the new techniques and methods used 
in describing such patterns. 

Many of the examples of learning we looked at in the earlier chapters 
involved sequences of responses rather than single responses. We treated 
these examples, however, as if they were single responses. In the Skinner- 
box experiments, for example, the learned behavior was characterized as a 
“lever-pressing” response, and we treated it as if only one act were necessary 
to carry out the learned instrumental response. Actually, there is an elabo- 
rate chain of responses involved in pressing the lever, a fact pointed out by 
Skinner (1938). The rat must see the lever, and this may lead to lifting the 
paws; it cannot begin the actual depression movement until it feels the 
lever; it must then seize and eat the food. Skinner described at least four 
separate responses as part of the chain involved in pressing the lever, and 
it is probable that a really detailed analysis would yield more. Certainly 
parts of the chain change as learning progresses (Hurwitz, 1954). 

We treated the instrumental activity in the Skinner box, however, as a 
single response, since we were not interested in the parts of the chain of 
behavior and how they became organized. The “response” was defined as 


something which produced an effect upon the environment—moving the 
150 
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lever through a certain excursion. After so defining the response, it was pos- 
sible to study the effects of different variables and to treat these as if they 
were affecting only one response rather than a whole chain of responses. 
Before we can even begin to appreciate the full range of problems in learn- 
ing, however, we must examine the available evidence on the organization 
of response sequences—how responses interact with one another when 
they occur in the stream of behavior. 

In one sense, the larger portion of the remainder of this book is devoted 
to the problem of the organization of response sequences, since interactions 
between responses provide the basis for much of forgetting, problem solving, 
and transfer of training. Therefore, as we consider each of these problems 
separately, we shall also be examining certain special effects of response 
sequences. 


SERIAL LEARNING IN ANIMALS 


Let us first look at the problems of serial learning in animals. Since ani- 
mal behavior is less complicated than human behavior and since animals 
have little available to them in the way of symbolic processes or linguistic 
behavior, we shall find applying the basic principles of behavior to serial 
learning in animals relatively simple. Later, when we come to deal with 
human serial learning, we shall examine the new methods and principles. 


Techniques in Studying Serial Learning in Animals 


Everybody knows that psychologists spend a good deal of time teaching 
White rats how to run mazes. This is partly because the maze provides a 
good technique for the comparative study of animal behavior, but also 
because the maze is an important source of information about serial learn- 
ing in animals. There are, of course, other techniques for the study of serial 
Processes in animals, but since the maze is the most widely used and im- 
Portant technique, let us look at it first. 

The maze. The maze is the tool par excellence for the study of the be- 
havior of rats. The rat’s natural habitat is a mazelike series of burrows, and 
it learns mazes efficiently and well. Consequently, most of the studies of 
maze learning have used rats as subjects. 

There has been an enormous variety of mazes used in experimental 
studies. Figure 28 shows the floor plan for one maze. The rat is placed in 
the box marked S, and it must make its way to the reinforcement in the 
box marked G. At each of the choice points, the rat must choose between a 
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blind alley and the path that will eventually bring it to the goal. An error 
is counted when a rat enters one of the blind alleys. In this particular maze 
the correct sequence of turns is a relatively simple LRLR. The maze can 
be lengthened or shortened by additional choice points, and instead of hav- 
ing these arranged for systematic alternation, they can be arranged ran- 
domly. 

Other mazes can be more haphazardly made, so that, for example, the 
distance between successive choice points is not the same. The famous 


Figure 28. The floor plan of a multiple-choice—point maze. The number of choices is probably 


smaller than would be used in practice. Each entry into a blind alley would contribute one error 
to a trial. 


Hampton Court maze is of this sort. Still other mazes will have more than 
one blind alley at a choice point. Finally, 


some have alleys leading away 
from the choice point 


at different angles; a right angle is typical (as in a 
multiple T maze), but more acute angles are frequently used. 

The usual animal maze is an enclosed alley. In some ex 
ever, the animals 


of course, 


periments, how- 
are allowed to run on open platforms. In enclosed mazes, 


animals cannot see more than one choice Point ahead, but in an 


elevated one, it is possible for the animal to obtain some overview of the 
maze pattern. 


There is obviously a wide Variety of mazes, each one Suitable for a differ- 
ent problem. One simple maze, the linear maze, has been important in 
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theoretical studies of learning. Figure 29 shows an example of a linear 
maze. It is just a long alley which has been divided into three or four com- 
partments. There are anywhere from two to four doors between each com- 
partment. Only one of the doors will swing open; the others will be locked 
shut. The rat must learn which door is free to open and to run to it. An 
important feature of the linear maze is that the choice points closely re- 
semble one another; consequently, stimulus generalization from one choice 
point to another is likely to be much the same. 

Other techniques. Instead of presenting a maze in which the animal must 
run, it is perfectly feasible to provide a sequence of other kinds of instru- 
mental activity. One experimenter (Arnold, 1947), for example, presented 
rats with a series of buttons which had to be pushed in an invariant order; the 
measure of learning in this case was taken from reaction times, or latencies. 

Frequently a Skinner box is modified so that the rat is presented with 
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Figure 29. A linear maze. There are four choice points with two choices at each. There is a 
“centering” door between each choice point to prevent the favoring of one door over the other by 
Virtue of the previous correct response. 


two levers instead of one. In some cases the rat must press the two levers 
in some particular order—two presses on the right lever and then two 
presses on the left lever (Schlosberg and Katz, 1943). In other cases the 
Object is just to see how the rat distributes its choices between two levers. 
In this connection, some experimenters have studied how the rat distributes 
its choices between pressing the lever and going to the food tray by putting 
the food tray on the opposite side of the box from the lever (Frick, 1953). 

The particular techniques for studying serial learning will vary with the 
problem being studied and the organism being tested. The maze is the 
instrument of choice with the white rat, because this animal is capable of 
learning an extended series of reactions in the maze. With other organisms, 
other techniques are sometimes preferred, and if one wishes to study the 
natural sequences of behavior, the maze has limited usefulness since it 
forces the animal to perform its choice reactions in an invariant sequence. 

Let us first consider the events that occur when an animal is forced to 
learn in the unchanging pattern imposed by a maze. 
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Theories of Maze Learning 


The original investigations of maze learning (Small, 1899, 1900) were 
made with the idea that the rat does not learn a series of movements but, 
rather, locations in the maze. This common-sense notion that the rat learns 
to recognize places has always been held by some investigators of animal 
behavior, but, for a time at least, the predominant view seemed to be that 
Tats learned sequences of movements. Today, we think it rather unim- 
portant whether the rat learns sequences of movements or locations in the 
maze, and we suspect that actually both kinds of learning occur (Blodgett 
and McCutchan, 1947). Historically, however, many important ideas were 
allied with the notion that rats learn sequences of responses in the maze, 
50 let us see how this hypothesis arose. 

Watson's theory of maze learning. Perhaps the earliest attempt to ac- 
count for the facts of maze learning in a systematic way is that of J. B. 
‘Watson. Watson was one of the great pioneers in the study of learning and 
one of the most important of the early behavior theorists. 

Watson assumed that an animal learning to run a maze is learning a se- 
quence of movements. After Pavlov’s experiments on the conditioned 
Salivary response became generally available in this country, Watson as- 
sumed that each of the movements in the sequence was much like a Pav- 
Jovian conditioned response. (Notice that he Was applying the principles 
of classical conditioning to instrumental learning.) Thus, 
basic process involved in habit was that by which a neutr 
to elicit a response when paired with a stimulus which alre 
to elicit the response. Watson’s Position in this matter wa 
fundamental that we can do no better than look at a 
second edition of his famous textbook (Watson, 192. 


Watson said, the 
al stimulus comes 
ady has the power 
S so important and 
quotation from the 
4, pages 293-294). 


Certainly at birth or shortly thereafter the ele 
every habit is formed can be noted. We mention 
the fingers, or of the lower and upper arm, 
rotation of the head, bending the trunk from side to side, the backward and for- 
ward, well-systematized movements of the legs, and a host of others. The conclu- 
sion is forced upon us that in habit no new elementary movements are needed. 
There are enough present at birth and more than will ever be combined into 
complex unitary acts. . . . One needs Only to examine the five or Six day old 
infant to be reasonably convinced that there is no need for the formation of 
additional reflex arcs to account for all later Organization. The new learned ele- 
ment in habit is the tying together or integration of Separate movements in such 


ments or units out of which 
the contraction and flexion of 
raising and lowering of the hand, 
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a way as to produce a new unitary activity. And by unitary activity we mean 
nothing more than the everyday acts of life, such as reaching out the hand for an 
object that stimulates the eye, picking the object up and carrying it to the mouth 
Or laying it on the table. . . . 

We can define habit then as we did instinct as a complex system of reflexes 
which functions in a serial order when the child or adult is confronted by the 


appropriate stimulus. 


Watson applied these general notions about the development of complex 
acts to the analysis of maze learning in rats—an example of how a whole 
series of discrete movements becomes organized into a unitary act. In a 
complicated maze, Watson thought, the rat must learn a series of responses; 
for example: run 5 feet, turn left, run 3 feet, turn left again, etc. He thought 
of the animal which had learned the maze correctly as having synthesized 
a whole chain of conditioned responses. The ingenious feature of Watson's 
notion was that the conditioned stimulus for each phase of the sequence of 
acts came from the preceding act. The conditioned stimuli in maze learn- 
ing stemmed primarily from the kinesthetic stimuli of the movements them- 
selves. Thus, the “feel” of each movement was the conditioned cue for the 
next movement. 

Watson (1907) thought he had evidence to show that the main sensa- 
tions rats used in learning mazes were kinesthetic. Unfortunately, it turned 
Out that Watson’s experiments were faulty and that kinesthetic cues are not 
necessarily the most important ones for rats in maze learning (see Munn, 
1950). Over and above this fundamental error, there were several other 
critical features against Watson’s notion of maze learning as a simple chain- 
ing of conditioned responses. 

If Watson’s basic idea is correct (aside from whether the conditioned 
stimuli are kinesthetic or not), the whole chain of responses ought to be 
Very mechanical; each response should be exactly the same each time and 
follow exactly in its proper sequence. Experimental support for this view was 
offered by Watson (Carr and Watson, 1908). Rats were trained in a maze 
in which the alleys could be lengthened or shortened without changing the 
pattern of turns which the animal had to make. Carr and Watson noticed 
that animals trained on short alleys (a turn required after a relatively short 
Straightway) attempted to turn at the same place when the alley was length- 
ened. Thus it looked as though the animals had learned a sequence of move- 
ments in which the nth movement was a turn. Apparently they had learned 
to turn right not when the first opening presented itself but after making a 


certain number of movements. 
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Such observations supported the notion of kinesthetic chaining as the 
basis of maze learning. Subsequent experimental work has shown, how- 
ever, that this cannot be the only, much less the principal, way in which rats 
learn mazes. The number of experimental refutations of this notion is large, 
but perhaps the simplest is provided by Macfarlane (1930). He trained rats 
to run a maze in which the passages contained water to the depth of 8 
inches. The rats were thus forced to swim the maze. He then switched them 
to walking by putting in a false floor slightly below the surface of the water. 
Since there was no increase in errors when the animals were transferred 
from swimming to running, the learned maze pattern did not consist entirely 
of a chain of conditioned movements. 

Watson believed that the most important condition determining learning 
and retention of the correct pathway in mazes was that correct responses were 
generally more frequent and also more recent than incorrect responses 
(Watson, 1914). Watson was not able, however, to account for many of the 
Special features of maze learning with these notions of frequency and re- 
cency, and the result has been that more modern theories have tended to 
abandon them in their simple form. 

Hull's analysis of maze learning. Hull, like Watson, has placed great 
emphasis upon the conditioned response. He also indiscriminately applies 
the same principles to classical and instrumental conditioning. Unlike Wat- 
son, however, Hull does not rely exclusively upon movement-produced con- 
ditioned stimuli. The result is that Hull has provided us with a theoretical 
analysis of maze learning that, while defective in ce 
been the most powerful analysis made to date. 


Hull (1930) differs from Watson in that he 
them 


rtain particulars, has 


allowed external cues from 
aze to be conditioned to each response. The visual stimuli at a choice 


point, he said, provide the cues for the response of turning to the right. He 
conceived of this process as being like classical conditioning, but it does 
not do violence to his notions to describe turning as instrumental 
brought under the control of a particular discrimina 
ential reinforcement. Hull follows Watson to the 
kinesthetic cues can provide part of the chain th 
proper order. Moreover, since animals are usually motivated during runs 
through the maze, he said that there is a drive stimulus which is conditioned 
to every response the animal makes (Hull, 1930, 1931); in other words, 


since the animal is equally hungry at every point in the maze, the stimuli 
from drive are conditioned to every part of the maze. 


activity 
tive stimulus by differ- 
extent of assuming that 
at keeps responses in their 
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An essential element in maze learning, according to Hull (1932) is the 
gradient of reinforcement (Chapter 2). Hull calls this the goal gradient, 
and it refers to the gradient of the effectiveness of reinforcement through the 
maze. In general, it states that correct responses close to the goal will be 
reinforced sooner (and hence performed more adequately) than correct 
responses distant from the goal. Hull (1943) largely attributes this goal 
gradient to a diminishing effect of secondary reinforcement as the animal 
gets further from the goal. The goal gradient, of course, is a temporal gradi- 
ent; it functions as a spatial gradient in the maze only because of the invariant 
Order of choice points in the maze. 

One thing the concept of the goal gradient allows Hull to do is deduce 
that the maze will be learned. This is no mean accomplishment; one of the 
real deficiencies of Watson’s notions of frequency and recency is that they 
did not unequivocally predict that the rat will learn the maze, a fact pointed 
out by Thorndike (1915). However, it is clear that any time an animal 
makes an error, the reinforcement of that error will be further removed in 
time than the reinforcement of the correct response at the same choice 
Point, for if an animal enters a blind alley it takes it some time to get out. 
The net result is that correct responses will be more strongly reinforced 
than incorrect ones. Another deduction from the goal gradient is that if 
the rat is allowed access to two paths to the goal box, eventually it will 
come to prefer the shorter path. 

A third prediction, one that has not always met with success, is that blind 
alleys near the goal will be eliminated before blind alleys at the beginning 
Of the maze. Spence (1932) has shown that this is generally true, but there 
are many specific cases in which the backward elimination of errors is ob- 
scured by other factors, such as the direction (relative to the goal) in which 
a particular blind is pointing. 

There is one other mechanism postulated by Hull that is important in the 
analysis of maze learning. This is the fractional anticipatory goal response. 
We have already seen that Hull suggested that a persisting stimulus, prob- 
ably arising from the drive state, is conditioned to every response in the 
maze. It is most strongly conditioned to the goal response, that is, to the 
response of eating food. It is possible, however, that this persisting stimulus 
May prematurely elicit the goal response at some point earlier in the maze. 
This fractional anticipatory goal response allows Hull greater freedom in 
accounting for the data of maze learning. He can, for example, use this 
notion in deriving the prediction that animals learn different pathways in 
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the same maze depending upon whether they are hungry or thirsty (1952). 
Also, this notion leads to the correct prediction that when maze behavior 
is extinguished, it does not extinguish serially as one might expect, but 
seems to disintegrate all at once. 

The fractional anticipatory goal reaction is much more of an ad hoc 
notion than that of the goal gradient, and sometimes Hull and his students 
have used it too freely in explaining the results of experiments. Hull uses 
this concept to describe much activity which other investigators have usually 
named anticipatory or cognitive. 

Studies on the linear maze. Hull and his students have given a good 
deal of attention to the study of behavior in linear mazes, or more generally, 
to situations in which a series of identical or nearly identical responses are 
required. The linear maze, as we described it earlier, is one in which all 
choice points look exactly alike. These points usually consist of a number 
of doors, only one of which will Open into the next compartment of the 
maze. 

Since the choice points in a linear maze look so much alike, there is a 
great deal of stimulus generalization from one to another. Because some of 
the choice points are closer together than others—choice point II is closer 
to II than to IV— some will generalize more to one than to others. Thus, 
if the animal learns to go through the second door at choice point I, it will 
generalize this tendency rather strongly to II, somewhat less to II, and 
much less to IV. Likewise, if the third door is correct on choice point IV, 
the animal will strongly generalize the tendency to go to the third door on 
choice point IIT; it will generalize this tendenc: 
very little on I. 

We can easily see that these tendencies, b 
to two kinds of errors at any given choice 
those which are incorrect at the choice poin 
rect later on. Thus the rat may choose door 
door 2, but 1 is correct at the next choice P 
anticipated the second choice point. Likewis 
sponses which are incorrect where they oc 
some earlier choice point. 


y a little less on point II and 


ased upon Eeneralization, lead 
point. Anticipatory errors are 
t at which they occur but cor- 
1 when it should have chosen 
Oint. In this case, the rat has 
€, perseverative errors are re- 
cur but which were correct at 


The effect of anticipatory and Perseverative errors has been studied in 
the linear maze under a wide Variety of conditions. In some experiments 
the correct door at each choice is different. In other experiments the same 
door is correct at every choice point. In some experiments the animals were 
reinforced only at the end of the maze; this Procedure allows the familiar goal 
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gradient to operate. In other experiments, however, the animals were rein- 
forced after each choice in the series; this produces a radically different 
gradient of reinforcement. 

The results of all of these experiments have been compared with the 
theoretical deductions derived from Hull’s general theory of learning. In 
most cases the experimental results and the theory agree. We can see very 
simply and intuitively how the results of these experiments could support 
the theory. 

It is easy to see, for example, that when a different door is correct at each 
choice point, the anticipatory and perseverative tendencies would produce 
nothing but errors. Moreover, since the middle choice points are influenced 
by both ends, the greatest number of errors during learning would be in 
the center of the maze. This is actually the case in experimental results with 
the linear maze (Hull, 1947). If, when a different door is correct at each 
choice point, there is reinforcement only at the end of the maze, there are 
fewer errors at the end of the maze than at the beginning. If each choice is 
reinforced, however, there is much less difference in errors between the two 
end ones, and in actual fact, there are fewer errors at the beginning than 
the end (Hull, 1948). 

If the same door is correct at every choice point, anticipation and per- 
Severation are beneficial to learning, for however the animal anticipates or 
Perseverates at any choice point, it still makes the correct choice. Since 
Perseveration cannot occur at the first choice point and anticipation cannot 
Occur at the last one, anticipation and perseveration are greatest in the 
middle; therefore, there are fewer errors in the middle (Sprow, 1947; 
Gladstone, 1948). 

In addition, we have some experiments (Smith, 1949; Woodbury, 1950) 
that show cases in which the choice points are neither all the same nor all 
different. In these experiments, the point at which the maximal number of 
errors occurred was in accord with the hypotheses of Hull. For example, 
if the first three choice points are all the same, but the last one different, 
then the greatest number of errors occurs at the next to the last choice point, 
Since the animal anticipates the last choice point most strongly here. 

Remember that anticipation and perseveration are special cases of 
stimulus generalization. The rat in the linear maze, when faced with a choice 
Point, will not always be able to tell which compartment it is in. This is a 
failure of discrimination and is therefore stimulus generalization. Hence, 
these studies on the linear maze tell us that the principles derived from a 
study of single instrumental and classical conditioned responses are useful 
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in analyzing a relatively complex form of behavior. These studies show the 
way in which generalization of stimuli associated with particular responses 
may interact to produce a complex pattern of behavior. 

Other serial problems. The analysis of maze learning presented by Huli 
and his associates has been generalized to all sorts of problems. Hull (1952) 
treats in detail sequences of responses in a modified Skinner box and a 
special problem, double-alternation. The important thing about his treat- 
ment is that essentially the same rules apply to these cases as apply to maze 
learning. In other words, it is necessary to consider generalization from one 
response to the next, gradients of reinforcement, and all the other mecha- 
nisms of interaction between responses. The way in which these factors 
combine to produce interaction between responses, of course, depends upon 
the particular problem. The cases which Hull has considered have nearly 
always been simplified so that the interaction between the Various factors 
is not too complicated. For example, in the linear maze all choice points are 
alike so that the same amount of stimulus generalization goes from one 
choice point to the next. 

There are, of course, many peculiar features in particular serial acts such 
as maze performance, that cannot be accounted for by mechanisms we have 
been describing thus far. Many of these are peculiar to the rat or even to 
the rat in particular mazes. Consequently, they are not of much general 
interest to the psychology of learning. There is a thorough discussion of 
factors limited to specific cases of maze learning in Munn’s Handbook of 


Psychological Research on the Rat (1950); some of these factors are im- 
portant to the comparative study of behavior. 


Statistical Sequences of Behavior 


In the maze, animals must perform responses in a more or less invariant 
sequence—they must first make a choice at the first choice point, then at 
the second, etc. In recent years, however, Psychologists have begun to study 
natural behavior sequences in which animals can emit responses in a 
variety of orders. In doing so, they have examined in greater detail some 
examples that we had come to think of as invariant sequences of responses. 
For instance, we have treated behavior in the Skinner box as if it were much 
more uniform than it really is. We have more or less assumed that each 
time the rat presses the lever it immediately turns to the food tray for the 
reinforcement. This is by no means true. Even when the food tray is placed 
directly under the lever, the rat will Occasionally press the lever several 
times before eating the food. This may even be the case when the pellet 
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dropped must be eaten before further pellets are released. Interesting 
problems arise when we deal with these naturally occurring sequences of 
behavior, and it will be worth our while to examine them in some detail. 

Statistical behavioristics. First of all, we need a method for analyzing 
the sequential interaction of responses. Such a method is provided by what 
has been called statistical behavioristics (Miller and Frick, 1949). The 
essence of this method involves computing the probability of a particular 
response occurring after some other response has been emitted. We may 
ask, for example, what is the probability of turning to the food tray after 
lever pressing? Obviously, it is high, though not perfect, because sometimes 
a second lever press will follow the first. What is the probability of return- 
ing to the food tray after eating? It is obviously low, for the rat, rather than 
return to the empty tray, will press the lever again. Thus, it is clear that the 
Probability of turning to the food tray does not stand by itself, but depends 
upon what the organism did just before. Thus, we may speak of the de- 
pendent probability of a response, and this, in general, means that the 
Probability of a particular response depends upon what preceded that 
response. 

By analyzing the dependent probabilities in sequences of behavior we 
Can discover many things. We can, for example, compute an index of stere- 
otypy, which tells us how regular and repetitive patterns of behavior are. 
Let us look at an example of the application of conditional, or dependent, 
Probability analysis to a problem in trial-and-error, or multiple-response, 
behavior. 

Dependent probabilities and trial-and-error behavior. To show how an 
analysis of dependent or conditional probabilities applies to sequences of 
responses in trial-and-error behavior, Miller and Frick (1949) have com- 
pared the behavior of a rat and a child in a multiple-choice experiment. The 
Problem for the rat and the child was to get out of an enclosure with four 
doors. Since the door which permitted an exit varied from trial to trial, it 
Was useless for the learner to attempt to try again the door which had been 
Correct on the previous trial. The original purpose of this experiment, which 
is a part of an older study (Hamilton, 1916), was to get an extended sample 
Of trial-and-error behavior in an insoluble problem. 

Miller and Frick point out that if one attempts to predict which of four 
Possible doors the child will pick in any one trial, the best guess is chance— 
unless information is available on the relative frequency of previous 
responses and on the response immediately preceding the response in ques- 
tion. Once we have this information, however, we can increase our predic- 
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tion over chance. If, for example, a sequence of two responses in the child 
is considered, we find that of the 16 possible sequences (four doors times 
four doors), 6 sequences appear 80 per cent of the time. Thus a knowledge 
of the choice just before the response greatly increases the accuracy of our 
prediction. 

When Miller and Frick analyzed the data involved in sequences of three 
responses, they found great regularity in the third choice. It was nearly 
always a response which was different from the preceding two. Thus, if the 
child chose doors 1 and 2, there was a very low probability that either doors 
1 or 2 would be chosen as the next response. Miller and Frick point out 
that this is one respect in which the child differs markedly from the rat. 
The rat was much more likely to go back to one of the doors chosen on the 


Preceding two responses. This suggests a longer memory span for the child 
than for the rat. 


Dependent probabilities in the Skinner box. A few paragraphs back we 
said that the sequence of lever Pressing, then turning to the food tray, and 


then back to the lever was not inevitable in the Skinner box. Sometimes, 
animals press the lever two or three times before 


detailed analysis of sequence in the Skinner bo. 
and Miller, 


going to the food tray. A 


X has been made (Frick 
1951) by altering the box as shown in Figure 30. The food 
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Figure 30. The floor plan of a Skinner box used to study sequences of lever Pressing and eating. 
The lever and the food tray are on opposite sides of the box so that a clear-cut count of both re- 
sponses can be made. Approaches to the food tray are counted by a photoelectric cell. (From Frick 
and Miller, 1951.) 
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tray is placed on the opposite side of the box from the lever; this makes 
the rat go back and forth across the box. The investigators tabulated the 
number of times the rat pressed the bar and the number of times it ap- 
proached the food tray, as well as the order in which these responses 
occurred. Thus if A means approach to the lever and B means approach 
to the food tray, a typical sequence of responses might be as follows: 
AAAABBBBBBAAAABBAAAAAAABABBBAAA, etc. 

These investigators studied the pattern of responses both before and after 
conditioning. Surprisingly enough, behavior was more stereotyped before 
conditioning than after. Stereotypy in this case usually meant repeating the 
same response in runs. In other words, if the animal approached the food 
tray (empty) before conditioning, it was more likely to repeat this act than 
it was to turn around and approach the lever. After conditioning, however, 
this pattern was broken. There was a tendency to alternate the responses 
(as perhaps the animals should) but this was less regular than the repetitive 
Pattern before conditioning. 

In another study, Frick (1953) applied this method of analyzing be- 
havioral sequences to the problem of the gross effects of anxiety or fear 
On behavior. In this case, the patterns of A’s and B’s were studied by the 
Autocorrelation function. The autocorrelation function tells to what extent 
later responses are dependent upon earlier responses and how many re- 
sponses back this influence may extend. The results of Frick’s study showed 
that a conditioned anxiety response induced by electric shock reduced the 
Patterning in the lever-pressing—-food-tray sequence without affecting the 
over-all rate of responding. 

These studies represent initial steps in the study of the temporal organiza- 
tion of behavior. It is probable that we shall see many new applications of 
the basic technique of studying dependent probabilities in the near future. 
As we shall now see, these same techniques have been applied to problems 


in human behavior. 


HUMAN VERBAL LEARNING 


The most important and obvious problems in the serial learning of human 
beings occur in verbal behavior. Our verbal behavior consists of chains of 
Words and phrases emitted in particular orders. These are clearly important, 
since the words do not readily make sense in a scrambled order. Try the 
following sequence on someone: “C will personnel by 20 applications room 
Of new for building be positions in the officer received.” The correct se- 
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quence is “Applications for new positions will be received by the personnel 
officer in room 20 of building C.” Consequently, the learning and emission 
of verbal behavior in relatively fixed and invariant orders is important to 
our use of language. 

There are other matters of great importance about verbal behavior, of 
course. Words carry meaning, and there are a variety of techniques for 
evaluating the role of meaning in verbal behavior, some of which we shall 
deal with later. Furthermore, it is clear that verbal behavior can be con- 
trolled and shaped by reinforcement in much the same way as the behavior 
of the rat in the Skinner box. Greenspoon (1955), for example, was able 
to control the rate of emission of Plural nouns in an interviewlike situation 
by making plural nouns contingent upon some simple indicator of satisfac- 
tion on the part of the interviewer (saying “um-hum”). Other studies show 
that verbal behavior can be controlled by stimulus conditions and rein- 
forcement schedules much as can any other instrumental activity (Buss, 


Weiner, and Buss, 1954; Verplanck, 1956). While this is important, the 
mere fact that verbal behavior is like any other cl 


havior shows that this kind of control is not importa 
ties of verbal responses their chaining and their m 
turn directly to the serial Characteristic of verbal lea 


ass of instrumental be- 
nt to the unique proper- 
eaning. Let us therefore 
ning. 
Methods in Rote Verbal Learning 

In rote verbal learning we consider the 
upon learning, performance, and retention. Ordinarily, when we are faced 
with the task of learning some new verbal 
poem, we sit down with the material in f 
Procedure, learn it. We seldom memorize 
ning and going through it line by line. Mo 
groups or skip from one part to the next. 

This hit-and-miss Procedure will not do if One is interested in studying 
response organization, or chaining, in verbal learning. In laboratory studies 
of human learning, we are interested in the relationships between the parts of 
the material to be learned and how these relationships determine features 
of learning. In order to Study these, it is necessary to hold constant the 
order in which the material is presented to the Subject. Thus, in laboratory 
learning, the method of learning is Partially controlled by presenting to the 
subject only one unit of material at a time in some predetermined order. 


influence of chaining of responses 
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Since the technique permits a control of serial relationships in the material 
to be learned, it also makes possible a systematic study of the influence 
these relationships have on learning. The serial-interaction effects are funda- 
mental to verbal behavior, and it is essential that laboratory studies of 
human learning be well controlled in this respect. 

Types of verbal material. As we shall see later, one of the most im- 
Portant determiners of rate of verbal learning is the type of verbal material 
learned, and it is important that this be carefully specified in every experi- 
ment. Furthermore, the verbal items in a given chain differ enormously 
from one another in important characteristics. Look, for example, at any 
of the sentences on this page. Such words as “in,” “a,” “the,” etc., are obvi- 
ously different from such words as “control,” “material,” “methods,” etc. 
Consequently, seldom have investigators of verbal learning used ordinary 
Sentences in natural languages as experimental material. Usually psycholo- 
gists have made up their own verbal material so that some of the uncon- 
trolled factors in ordinary sentences are eliminated or held constant. 

One of the first investigators of human verbal learning, Ebbinghaus 
(1885), devised the most widely used technique for controlling the kind of 
Verbal material used in such experiments. He showed that we can use the 
Previously acquired verbal skills of human beings, and at the same time 
eliminate previous learning for the specific verbal material we wish to have 
the subjects learn. This can be accomplished with nonsense syllables. These 
are syllables novel enough to ensure that a uniform minimum of the sub- 
Ject’s verbal skills is called upon. Indeed, if our syllables are truly “non- 
Sense,” the subject cannot bring into the experiment a large assortment 
Of acquired reactions to the specific items he is to learn. 

The nonsense syllable is usually constructed of three letters, most fre- 
quently two consonants with a vowel between them. Thus SYL or NEP is a 
Nonsense syllable. Some of these three-letter nonsense syllables resemble 
English words more than others (for example, compare XyQ and CET). 
For this reason, several investigators have “calibrated” nonsense syllables 
in terms of their association value (Glaze, 1928; Hull, 1933). Such scaling 
has resulted in classifications of nonsense syllables according to the degree 
of meaning or familiarity. Incidentally, the association value, in terms of 
the number of associates given to nonsense syllables and words, has re- 
cently been itself used as a variable in verbal learning (Noble, 1952b; 
Mandler, 1955). 

Other kinds of material, such as lists of adjectives, etc., have been cali- 
brated in many ways. Melton (cited in Hilgard, 1951), for example, has 
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scaled lists of adjectives according to a measure of their degree of re- 
semblance. This kind of calibration is important in the study of generaliza- 
tion or similarity in verbal learning. Various statistical tables (Thorndike 
and Lorge, 1944) giving the relative frequency of occurrence of words in 
ordinary English prose are an important source of controlled verbal material 
for the study of learning. The verbal material an investigator wishes to use 
in a particular study will depend upon the purposes of his study. There is 
now available a wide variety of material, and as new needs arise, new con- 
trolled material will be developed. 


The serial method. In rote verbal learning the method of presenting 


Figure 31. A memory drum. The s 
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ice. (Courtesy of Ralph Gerbrands.) 


the material is usually controlled in order to force certain kinds of rela- 
tionships between the items in the material. In the serial method the ma- 
terial is forced into a rigid sequence. A memory drum (see Figure 31) is 
usually employed to achieve this Order. The items to be learned (nonsense 
syllables, for example) are exposed to the Subject one at a time in the 
small window in the memory drum. These items are usu 
constant rate, so that the same amount of time is spent 0 
Subject is instructed to anticipate each syllable before it al 
dow. There is sometimes a blank Space between each syll 
the subject time to anticipate. Usually the subject is 
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pronunciation difficulties. 
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In the serial method, each item serves in turn as a stimulus and as a 
response. For example, the first item in a list might be the nonsense syllable 
SYJ. When the list begins on each trial, the stimulus for this syllable is pro- 
vided by the symbol used to indicate the beginning of the list. The next 
syllable might be Loz. syJ is the stimulus which tells the subject that Loz is 
coming up. Loz in turn might be followed by NEP. Thus, syJ is the stimulus 
for the response “L-0-Z.” LOZ is the stimulus in turn for the response 
“N-E-P.” 

Table 1 shows the results of an experiment on rote verbal learning with 
one subject. In the columns are given the 12 syllables in the list. The rows 
represent the trials, and there were 20 given to this subject. Note that a trial 
is defined as one repetition of the entire list. The plus signs represent correct 
anticipations, and the zeros represent incorrect responses or no responses 
at all. In some experiments, incidentally, we may be interested in recording 
the exact errors made by the subject. The first time through the list, of 
Course, the subject can get no responses right, since this is the first time he 
has seen the material. Notice that this particular subject guessed at the first 
syllable on the second trial and got it right. On the third trial he correctly 
anticipated the first syllable, the second syllable, and the last syllable. On 
the nineteenth and twentieth trials the subject anticipated all the syllables 
Correctly. 

If we are interested in studying the retention of verbal material, some 
time after the original learning we may give a test of retention. Such a test 
could be obtained in exactly the same way the learning data were obtained; 
the subject is required to anticipate each syllable in turn. On other occa- 
Sions, we may be interested in a free-recall test; the subject is simply asked 
to recall as many of the syllables as possible in their correct order. 

The paired-associates method. For many purposes there is one grave 
defect in the serial method—each item serves as a stimulus and a response 
in turn. Sometimes it is important to sort out and independently vary the 
Stimulus and response aspects of a task to be learned. This cannot be done 
With the serial method; consequently, a modification known as the method 


Of paired associates has been developed. 

In this method, as its name implies, 
learner. He is instructed to learn the pairs in such a way that when the first 
member of the pair appears, he is able to recall the second. Thus the 
first member of a pair is a stimulus item and the second member a response 
item. Unfortunately, in this method, the serial order of presentation cannot 
be retained, since if it were, subjects could make a connection between re- 
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sponse items and the stimulus items which follow. Therefore, in this method, 
the pairs of stimulus and response items are presented in some random or 
prearranged order which is different for every trial. Obviously, the method 
of paired associates destroys the simple serial characteristics of rote learn- 
ing, but it enables the investigator to distinguish between stimulus material 
and response material. For some problems the simple anticipation method is 
more suitable; for others the method of paired associates provides the most 
useful technique. The uses of this method will be described in later chapters. 


Associations in Rote Verbal Learning 

One of the fundamental problems is the degree to which different items 
ina verbal-learning task are associated with one another. In the method of 
serial learning we know that successive items are associated, since otherwise 
the learner would not be able to anticipate. It turns out, however, that 
learners also make connections between items that do not adjoin. These are 
called remote associations. 


The nature of remote associations. 
syllable A becomes a cue or stimulus for the response of nonsense syllable 


B. B is a cue for the response C, etc. The subject learns the list in order. 
In addition, however, syllable A is connected to some degree with syllable 
C, so that A will occasionally give rise to the response C. Such responses, 
Of course, are analogous to anticipatory errors found in the studies of the 
linear maze. In verbal learning these anticipatory responses are called 


In a typical experiment, nonsense 


emote forward associations. A 
Since perseverative errors are found in the linear maze, it might be sup- 


Posed that the same errors would be found in rote verbal learning as well. 
Indeed, they are. If the learner responds with A when he should have re- 
Sponded with D, a perseverative error has occurred. In verbal learning, 
Such errors are called remote backward associations. 


Ebbinghaus (1885) was the first to detect these remote associations 


through a technique known as the method of derived lists. First he learned a 


list of nonsense syllables—Ebbinghaus always used himself alone as a sub- 
ject—then learned a second list obtained by systematically rearranging the 


first list. Sometimes in the rearrangement every other item was made ad- 


Jacent, Sometimes every third item, and sometimes every fourth. Ebbing- 


haus’s results showed that he could relearn systematically rearranged lists 
much more quickly than he could relearn ones haphazardly rearranged or 
an entirely new list. Furthermore the fewer the number of items skipped in 
the rearranged lists, the quicker the time of learning. From this Ebbinghaus 
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inferred that remote (forward) associations had helped him in the learning 
of the rearranged lists, and therefore that these associations must have been 
learned in the original lists. 

Ebbinghaus’s results tell us little about the specific remote associations 
formed. Nor finally, does his method indicate the frequency of remote asso- 
ciations. 

A method used by McGeoch (1936) tells us a 00d deal more about the 
exact nature of remote associations. He asked subjects to practice, by the 
method of anticipation, memorizing a series of nonsense syllables. He 
stopped the practice before the Subjects had learned to anticipate perfectly 
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there is an association between a trace of the earlier item and the occurrence 
of the later one. This theory has been greatly elaborated by Hull and his 
associates (Hull, et al., 1940) and used to explain many characteristic phe- 
nomena of rote serial learning. 

Another theory interprets remote associations as mediated connections 
between two items in a list and a third item common to them. The medi- 
ating factor might be something within the subject himself or some constant 
feature of the environment (the context) to which the specific items on 
the list become connected (McGeoch, 1942). There is experimental evi- 
dence that mediated associations can produce learning. Suppose, in a 
Paired-associate experiment, we ask subjects to associate nonsense syllables 
to common words, so that given the nonsense syllable, the subject can recall 
the appropriate word. For example, given CEF, the subject would learn to 
Say “stem.” Now “stem” is a strong eliciting stimulus, in a free-association 
test, for the word “flower.” And “flower,” in turn, is a strong eliciting stimu- 
lus for “smell,” but “stem” does not tend to elicit “smell” directly. If we 
Tow take a series of such chains and ask the subjects to learn the third mem- 
ber of the chain (smell) to the original nonsense syllables, apparently they 
learn more rapidly than they would learn a list of unrelated words (Russell 
and Storms, 1955). In other words, the associative habits built into the 
Subjects before the experiment result in the mediation, or transfer, of the 
Pairing of the same nonsense syllables from one list of words to another. 
This, of course, tells us nothing about how such mediational activity comes 
about, but it does suggest that associations between two items arise from 
the mediation of chains of intervening associations. 

Remote forward and backward associations in rote verbal learning may 
have more than a superficial similarity to anticipatory and perseverative 
errors in linear maze learning. Anticipatory and perseverative errors are at- 
tributable to stimulus generalization. It is difficult, however, to see how 
Simple gradients of stimulus generalization in verbal learning could be based 
exclusively upon resemblances among the items themselves. Such similarities 
Would not always produce gradients of remote associations, since the re- 
Semblances might be distributed in a chance manner throughout a list. It is 
Possible, however, that the context in which a particular item is to be emitted 
allows for stimulus generalization (see Schulz, 1955). If, for example, in a 
list of nonsense syllables Dox is followed by CET, DOX and the context pro- 
Vided by having gotten through so many items in the list elicit CET. Suppose, 
however, that the context generalizes to some later point in the list. Dox might 
then elicit some other response, say ZYM. In intuitive terms this would happen 
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when the subject had “lost his place” in the list. It is even possible that such 
contextual generalization might be aided by random resemblances among the 
syllables themselves, as perhaps would be the case if DoQ preceded ZyM 
in the list. 

If the notion that contextual and item generalizations are responsible for 
remote forward and backward associations is correct, then the number of 
remote associations ought to increase directly as the similarity between 
items in a list is increased. Although there is no direct evidence that this is 
50, there is some indirect evidence. Underwood and Richardson (1956) 
find, for example, that rate of learning is a function of the similarity of 
items within a list. The higher the similarity, the lower the rate of learning. 
This means that the greater the generalization between items on a list the 
slower the learning, and, by implication, this lower rate of learning is the 


result of a greater number of remote associations intruding at the wrong 
point in the list. 


The Serial-position Effect 


For a long time students of learning have been aware that there must be 
an intimate connection between remote associations and other characteristic 
Phenomena of rote verbal learning. One of these is the serial-pos 
This effect refers to the fact that the rate at which a given ite 
list of items is learned depends upon the location of that ite! 
list. When learning is by the serial method, 
sense syllables is learned most quickly and the end of the list is next easiest 


to learn. For this reason, the serial-position effect h 
the primacy-finality 


ition effect. 
m in a serial 
m within the 
the beginning of a list of non- 


as sometimes been called 
effect, to emphasize this point. 


Serial position in rote serial learning. We can see what the serial-posi- 
n effect looks like by examining the curve in Figure 33. This curve is 
taken from an experiment on the memorization of nonsense syllables by the 
serial method (Hovland, 1938b). The total number of errors or failures of 
anticipation at each position in the list is shown on this curve. The greatest 


number of errors occurred at the seventh syllable position, which is just 
past the middle of the list. 


The shape of this curve has wide 
list of serial items le. 
As more items 


tio. 


Senerality. The most difficult part of a 
arned by anticipation is Just past the middle of the list. 
are added, the point of maximum errors always remains in 
the same relative position (Hovland, 1940). The serial-position curve is 
essentially the same for meaningful words as it is for nonsense syllables, 
though, of course, the total number of errors is less in the former case (Mc- 
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Crary and Hunter, 1953). If one compares early performance with late 
performance in learning, of course, the serial-position curve is flatter in the 
later case, but this is simply because all of the items are finally learned 
(Ward, 1937). 

Explanation of the serial-position effect. The best-known attempt to 
explain the serial-position effect for anticipatory rote learning is that of 
Hull (1935). Hull argued that remote forward associations inhibited im- 
mediate associations between them. For example, a remote association be- 
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tween item ld inhibit or 
S 1 a would inhi চি y a 
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intervening direct associations to Occur. Consequently, 
dle of the list should be inhibited in proportion to the number of delayed 
Associations which span them. This notion is illustrated In Figure 34 (Hull, 
1935). Notice that the greatest number of delayed Associations bridges the 
middle items, and this is a way of representing the notion that the most 
difficult items to learn should be in the middle. Cl 
There are two things wrong with Hull's hypothesis. First of all, it ignores 
this is important because remote forward 


Temote backward associations; চে as 
mmetrical. Secondly, it ig- 


and backward associations are not perfectly sy 
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SERIAL POSITION 


Figure 34. Illustrates how remote forward associations could account for a symmetrical serial- 


Position curve. The number of errors or omissions at each position is Proportional to the number of 
associations spanning that portion. (Hull, 1935.) 


nores the fact that the middle item is not the most difficult to learn in actual 
practice; the items just past the middle are. Hull and his associates (Hull, 
et al., 1940) tried to remedy this difficulty in a general theory of rote learn- 
ing, but the results were so terribly complicated that they have been re- 
garded as rather unconvincing. 


Bugelski (1950) pointed out that remote associations could much more 


convincingly account for the serial-position effect if we simply assume that 


they interfere with the emission of correct responses. This would place the 
greatest interference at slightly past the center of the list. Thus, while Bugel- 
Ski’s analysis produces some skewing in the theoretical serial. 
it does not produce as much as actually exists in practice. 


If interference from remote associations is the 
ing the serial 


-position curve, 


primary factor in produc- 
Position effect, it ought to be possible to test this directly by 


counting the intrusions of wrong responses at each position. Unfortunately, 
the usual method of presenting material in 


does not permit this to be done. The s 
fast rate, so most of the “ 
intrusions. This may be 


serial-anticipation experiments 
yllables are paced at a moderately 
errors” are simply failures of response, rather than 
because the subject takes too long in getting the er- 
roneous response out, and by the time he is ready to do so, the next syllable 


has appeared. Deese and Kresse (1952) performed an experiment in which 
they allowed unlimited time for the subject to respond between syllables. 
When this is done, it appears that the remote associations (intrusions of 
other items from the list) are almost Symmetrically distributed about the 
middle of the list. The displacement of maximum difficulty from the middle 
to just past the middle is explained by the fact that, as the list proceeds, the 
subjects are simply unable to respond with even a reasonable guess. Thus it 
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appears as though the serial-position effect is composed of two components, 
an almost symmetrical component of overt intrusions or remote associations, 
and an increasing tendency through the list to be unable even to guess. 

Though the tendency to make intrusive errors based upon remote associa- 
tions accounts for the bowed shape of the serial-position curve for rote 
serial learning, it does not account for the skewing, or displacement, of the 
curve. This displacement is probably the result of a combination of several 
factors including memory span and perhaps something like the reactive in- 
hibition factor discussed in the chapter on extinction. 

The serial-position curve in free recall. Suppose that a list of nonsense 
syllables or randomly arranged words is presented to a subject, and, instead 
Of recall by serial anticipation, the subject is allowed to recall the words in 
any order he chooses. Under these conditions the most frequently recalled 
Words are at the end of the list (Welch and Burnett, 1924; Deese and Kauf- 
man, 1957). Two sample serial-position curves for such free recall are pre- 
sented in Figure 35. (Note that these curves are plotted in correct responses 
rather than errors.) Thus it appears that the serial-position curves for serial 
anticipation and for free recall are roughly the reverse of one another. In 
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one, the greatest number of correctly recalled items is at the beginning of the 
list, and in the other, the greatest number is at the end. 

It turns out that in free recall the greatest determinant of the probability 
of recall (as well as the order of items in recall) is the associative strength 
of the item (Bousfield, Cohen, and Silva, 1956). Thus in the immediate free 
recall of randomly arranged items it is probable that the iast few items in 
the list are strongest, perhaps because of the persistence of an image of 
them; therefore, these items are recalled first and have the 


greatest probabil- 
ity of being recalled. 


Something quite different seems to happen with the free recall of con- 
nected prose, however. If brief Paragraphs of English 
material and recall is unrestricted, the resulting serial-position curve looks 
more like that of the method of anticipation rather than that of the free 


recall of random items (Deese and Kaufman, 1957). This is, of course, 
because the recall of connected material is not re 


teristics of ordinary prose force the Subject to st 
£0 through to the end, 


(see below) of langua 
ganized material is m 
terial. A systematic e 


are used as learning 


ally free; the serial charac- 


recall by serial anticipation (Deese and 
ously suggests that the method of serial 


€ recall processes used for ordinary lin- 
Ofore supposed. 


Kaufman, 1957). This result seri 
anticipation is much more like th 
guistic material than we had heret 


Statistical Organization of Verbal Material 

In the last few Paragraphs we have discussed 
tion in verbal material without reall 
tion.” In this section we shall discus 
a particular kind — statistical Organi. 
tial learning of verbal material. Sta 
count of dependent probabilities b 
that dependent probabilities have b. 


the question of organiza- 
Y saying what we mean by “organiza- 
S the meaning of verbal organization of 
Zation—and how it affects the sequen- 
tistical organization involves taking ac- 
etween items in exactly the same way 
een applied to multiple-choice behavior- 


explored this Problem, we are already 
familiar with the general concepts. 


A moment’s thought will make it clear th 


at dependent probabilities apply 
to the elements of languages. T. 


ake the occurrence of letters in written Eng" 
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lish, for example. Everybody knows from Poe’s The Gold Bug that letters 
in English do not occur with equal frequency. The letter e appears more 
frequently than any other letter; consequently, if we were to guess that the 
letter e occurs in a given sequence of letters rather than simply picking a 
letter at random, we should have a better chance of being right. Likewise, the 
Occurrence of a given letter is influenced by what has gone before. Therefore 
We can use our knowledge of what has preceded in a particular passage to 
predict what is going to happen next. In English, the letter q is always fol- 
lowed by u; thus the letter immediately after q is perfectly predictable. It is, 
to use the term frequently used in such analyses, an example of perfect re- 
dundancy— the u tells us nothing new, and if it weren't for habitual usage, 
might just as well be left out. There are associations between all other letter 
combinations in English, though not quite so strong as that between q and u. 
The letter 1, for example, is more likely to be followed by A than by m. 

It is easy to see, furthermore, that the probability of occurrence of a 
Liven letter will be influenced by the letter which precedes it by two or three 
Spaces and more. In English, for example, two consonants are more likely 
to be followed by a vowel. The probability of occurrence of a given letter in 
English, as a matter of fact, is influenced to a significant degree by letters as 
Many as seven positions in front of it (Newman and Gerstman, 1952). 

The method of actually calculating the probabilities involved in such or- 
Sanized sequences of events in language derives from the mathematical 
theory of communication or information. Information theory, as it is usually 
called, has had important methodological implications for the study of 
human language and learning. We shall not deal with the theory directly in 
this book, although it will indirectly enter into many of the problems of lan- 
Suage learning that we shall face. A general introduction to information 
theory is given in Miller's Language and Communication (1951). For now, 
however, let us simply accept the fact of dependent probabilities and the 
Tesulting statistical organization in language, and examine some of its impli- 
Cations in the serial learning of linguistic material. 

Statistical organization and immediate jree recall. A textbook on the 
Psychology of learning is hardly needed to tell one that nonsense material 
1s harder to learn and recall than connected, logical material. Some experi- 
mental studies show us, however, that it is not necessarily the sensibleness 
Of connected material that is primarily responsible for this. 

Miller and Selfridge (1950) show how gradual approximation to the 
Statistical structure of ordinary language influences the amount of material 
recalled. An order of approximation to sensible English may be defined by 
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the number of preceding items which determine the present item, and in 
practice it serves as a way of constructing verbal material with various 
levels of dependent probabilities without going to the enormous labor of 
computing the necessary probabilities involved. If we select words at ran- 
dom from the dictionary or a word list, we should have a zero-order approxi- 
mation to English. An example of a zero-order sequence might be: “pack 
byway phosphor kettle hypermetrical gloat.” A first-order approximation 
would select words according to their relative frequencies in English usage. 
Thus common words like “the” and “before” would most likely show up. 
If we gave an individual a word to begin a sentence with and gave the sec- 
ond word of his sentence to another individual for him to start a sentence 
with, etc., we should be able to construct a second-order approximation 
from the resulting list of words. An example of a second-order approxima- 
tion taken from Miller and Selfridge’s paper looks like this: “was he went 
to the newspaper is in deep and.” For higher-order approximations, indi- 
viduals may be given sequences of words to use. Jn these cases, they add a 
word directly after the sequence and drop the first word of the sequence. 
This new sequence is then given to another individual, etc. The words im- 
mediately following the sequence of given words provide us with a higher- 
order list. An example of a seventh-order approximation obtained in this 
manner looks like this: “then go ahead and do it if possible while I make 
an appointment I want to skip very much around the tree and back home 
again to eat dinner after the movie early so that we could get lunch because 
we liked her method for sewing blouses and shirts is.” 

We can see that the seventh-order approximation appears to make vague 
sense—a sort of stream-of-consciousness sense. This is because it preserves 
the ordinary dependencies of speech without having anything in particular 
to say. Miller and Selfridge showed that it is this chain that determines how 
much material a person remembers. Up to a point, the higher the order of 
dependency, the easier it is to remember. With adult subjects, Miller and 
Selfridge found little or no difference in the recall scores for a seventh-order 
approximation and good, sensible English. Thus, even though the seventh- 
order approximation is essentially nonsense, it is recalled as well as the 
sensible material in a test of immediate free recall. 

A summary of Miller and Selfridge’s results is given in Figure 36. This 
shows the relationship between order of approximation and per cent of 
words correctly recalled. Notice that above fifth-order approximations there 
seems to be little improvement in recall. This apparently is because this 
and higher approximations preserve the short-range dependencies which 
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the individual can remember. Thus it appears that to some extent meaning- 
ful material is more easily learned because it preserves the short-range 
associations of ordinary language. 

As we mentioned earlier, introducing statistical organization also changes 
the form of the serial-position curve. Deese and Kaufman (1957) studied 
the serial-position curves that resulted from the immediate free recall of 
lists of words with different orders of approximation. Introducing the or- 
ganization into the material presented for free recall gradually changes the 
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Figure 36. Per cent of words correctly recalled as a function of the order of approximation to the 


statistical structure of English. (Miller and Selfridge, 1950.) 


Characteristics of the serial-position curve from those of recall of unorgan- 
ized material to those of serial anticipation. This seems to be additional evi- 
dence that the statistical organization preserves the short-range anticipa- 
tory associations in ordinary language. It should be emphasized that statis- 
tical dependencies in language do not account for all the greater ease of 
Tecall for meaningful material; some of it must be due to the context derived 
from the content of the material (Marks and Jack, 1952). 

Recall and organization in nonsense syllables. In another approach to 
much the same kind of problem (Aborn and Rubenstein, 1952; Rubenstein 
and Aborn, 1954) a group of subjects was made familiar with a set of 
Nonsense syllables. These subjects then learned combinations of these non- 
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sense syllables with various degrees of patterning introduced among the 
syllables. Under one condition they learned random arrangements of the 
syllables, under other conditions they learned that there were to be patterns 
among the syllables and were instructed in the actual nature of these patterns. 
After learning the nature of the patterns, they memorized the syllables them- 
selves. It was clear that learning the organizational rules beforehand enabled 
these subjects to remember more individual syllables. However, interestingly 
enough, they did not profit from the Patterns as much as they could have. In 
terms of amount of information (which takes into account the existence of 
the patterns) they actually remembered less with highly organized than with 
random patterns. While it is possible that this is merely because they did not 
have enough practice with the Patterns, it suggests th 


learning, do not take as much advantage of statisti 
should. 
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CHAPTER 8 


THE LEARNING OF SKILLS 
AND CONDITIONS OF PRACTICE 


Most of us have at our command an enormous range of skills we have 
Patiently acquired over the years. Some of these skills are highly specialized 
and some are fairly universal within a given culture. Most American adults, 
for example, seem to be able to drive an automobile, while only a few of 
Us can operate a machine lathe. The psychologist’s interests in these skills 
Are many and varied. Some psychological problems arise, for example, 
When we ask about the pattern of native abilities that goes into any given 
Skill. Other psychological problems come up when we ask about the inter- 
actions of these skills—how an individual may be hindered or helped in 
learning a new skill by previously acquired skills. In this chapter we shall 
be concerned primarily with the ways of experimentally analyzing the learn- 
Ing of complex skills and how such learning is influenced by the methods 
and conditions of practice. We shall largely ignore the question of abilities 
and defer to later chapters the question of interaction among separate skills. 


LEARNING CURVES AND SEQUENCES OF SKILLS 


The first thing to occur to anyone given the task of finding out how 
human beings acquire skills would be to chart the results of practice at the 
task in question. Psychologists spent a great deal of time doing this in the 
early experimental studies of the learning of human skills. Often as much as 
a third of an early textbook on the psychology of learning was devoted to 
the measurement of improvement with practice and plotting this improve- 
Ment in learning curves. 


There is sometimes much to be gained by this practice. Frequently, in 
181 
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remedial-reading courses or in special-training courses in reading, the in- 
Structor recommends that students keep records of their daily reading rates 
and plot these on a chart. There seems Sood reason to suppose that this is 
a useful teaching device for some skills. Sometimes learning curves are used 
in industry for monetary incentive scales and for evaluation of the success 
of particular industrial training programs. 

In addition to these more or less practical uses, however, it has some- 
times been thought that learning curves would tell us something funda- 
mental about the learning Process itself —indeed, perhaps that there was a 
generalized law about the learning of skills. Unfortunately, such hopes 
about learning curves have never quite been realized. However, there are 
general characteristics of some learning curves that are important. 

A theoretical derivation of a learning curve. Thurstone (1930a) pre- 
Sented a theoretical derivation of a learning curve based on the assump- 
tion that all acts could be classified into successful components of skills 
and errors which will not lead to successful completion of the act. From 
this and simple assumptions about rate of error elimination Thurstone de- 
rived an equation that describes a learning curve. This curve can represent 
the acquisition of a remarkably wide class of skills. For example, at least 
one aspect of the learning of typewriting can be described by this equation, 
since the results of practice at typing can easily be classified into successes 
and errors. The derivation, however, would yield no information about a 
curve describing the effects of Practice on speed of typing. 

One interesting thing about Thurstone’s derivation is that it contains 
some Psychologically meaningful numerical constants. There is one, for ex- 
ample, which concerns the rate of error elimination. If the constant has the 
Value of zero, no errors will be eliminated because the learner will not pr ofit 
by the performance of an erroneous response, and its probability of reoccur- 
rence will remain unchanged. If, however, this constant has the value of 1.0, 
each time an error is performed its Probability of repetition is reduced to 0. 
Thus, what is commonly called trial-and-error behavior can be described as 
happening whenever this error constant is between 0 and 1.0. If this con- 
stant becomes greater than 1.0, however, it means that the occurrence of 
an error eliminates not only this particular erroneous response, but the 
learner profits enough by this mistake to be able to eliminate possible fu- 
ture occurrences of other errors. This, Says Thurstone, is the essence of 
rational learning. If a learner can profit by error to the extent of generaliz- 
ing to other possible errors, it certainly is more than trial-and-error learning. 

The possibilities inherent in Thurstone’s analysis of learning curves have 
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never been fully realized. We should note, however, that Thurstone’s anal- 
Ysis does not cover all learning curves; it is meant only for those in which 
the components of performance can be plotted in terms of proportion of 
Correct or erroneous responses. Even here the agreement with data is not 
As good as with other theories. 

Types of learning curves. There are many ways in which components. 
of skills can be measured. For skills in which the components can be classi- 
fied into errors and correct responses, curves can be plotted which show the 
Way in which errors decrease with practice (trials) or the way in which 
Correct responses increase. In addition we can plot curves which show the 
length of time it takes to complete a unit of practice at the task. This is the 
Kind of curve one would obtain if the length of time it took an individual 
to assemble a jigsaw puzzle were measured. Obviously, time per trial de- 
creases with practice, or we do not have evidence for learning. If, for exam- 
Ple, the same individual assembled the same jigsaw puzzle several times in 
Succession and there was no decrease in time of assembly, we should con- 
clude that the individual had not shown evidence of learning to assemble 
the puzzle. 

In many tasks we can obtain measures based upon both errors and time. 
In maze learning, for example, we can plot both the number of blind alleys 
4 subject enters on a given trial and the time that it takes him to £0 through 
the maze. 

In addition, in some tasks, we may be interested in plotting the way in 
Which accuracy changes with an increase in the number of trials. If we were 
Studying the ability of an individual to learn to fire at a stationary target, 
for example, we might plot a learning curve in terms of his deviation from 
the center of the target for each trial. Or, if an individual were tracking a 
Constantly moving target, we might plot the amount of tine he stayed on 
the target for each trial. 

It should be noted that the measures of improvement are not all neces- 
Sarily independent of one another. In maze learning, for example, number 
Of errors and time per trial will be highly correlated. This is simply because 
the fewer the entrances into blind alleys, the less time it takes to go through 
the maze. There are fewer mechanical correlations as well. In general, for 
eXample, an increase in accuracy will go along with a decrease in the 
Amount of time necessary to accomplish a unit of work. In other words, 
improvement in one aspect of a task is usually accompanied by improve- 
Ment in other aspects as well. This is not necessarily so, but it is generally 
true of learning outside of the laboratory. 
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In addition there are many other changes in behavior during learning which 
often do not get charted. Usually there is a change in attitude of the indi- 
vidual toward the task. A task may be regarded as difficult at the outset of 
learning and as easy after much practice. There may be a change in the 
degree of tension which an individual shows during learning. In the begin- 
ning there may be much tension and anxiety which may be reduced by the 
learning of a task. Finally the mode of attack of the learner will most surely 
vary throughout the course of learning. Thus it is clear that there are many 
aspects of the change in performance with practice, and some of these are 
often ignored. 

Many times learning curves are obtained by averaging together the per- 
formance of a number of individuals at each trial. The learning curve then 
is a kind of average learning curve. “Kind of average” is sometimes 
apt, for there are frequently serious distortions introduced by this proce- 
dure. This has been pointed out many times, most recently by Estes (1956). 
Most generally this is true because the form of the curve for averaged 
measurements is not necessarily the same as that for the individual subjects. 
For example, suppose we plotted learning curves for a group of individuals 
who always learned instantaneously (as would be the case if the learners 
had “insight” ). Thus on some one trial an individual’s score would go from 
zero to the maximum possible score. If some of these individuals learned at 
different trials, the result could be a smooth, gradual learning curve for the 
average of all these individuals. If we looked only at the average curve we 
should infer that the learning of this task proceeded in a gradual way, with, 
perhaps, more learning in the early trials than in the later trials.* Thus, be- 
cause learning curves for individuals are not always the same as those for 
groups, we must be careful about the inferences we draw. 

Plateaus. If we look at learning curves for individuals an interesting 
phenomenon occasionally turns up. This is illustrated by the curve in Fig- 
ure 37. The task this individual was practicing was receiving telegraphic 
code. Notice that there was a period of about six weeks during which the 
subject did not appreciably increase his ability to receive code signals. 
After this period, there was a sudden spurt in letters received per minute 
which gradually leveled off to a final asymptote. Such a period of little or 
no improvement is known as a plateau. 

The particular curve in Figure 37 comes from a well-known older study 
by Bryan and Harter (1897, 1899), who were trying to determine the 
reason for this plateau, which they thought fairly typical of learning tasks 


1 This would be true if the distribution of occurrences of “insight” were logarithmic. 
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like receiving code. They put forward the notion that a plateau occurs when 
an individual has learned all of the primary skills in a complicated task 
but has not yet organized them into broader units so that he can learn the 
higher-order skills. They argued that learning any skill like telegraphy in- 
Volves a hierarchy of habits; that is to say, the learning of the higher-order 
Skills depends upon mastery of the lower-order skills. In learning to receive 
telegraphic code, for example, individuals first learn to receive individual 
letters. Then as they master the skill involved in this, they can take advan- 
tage of the redundancy, or predictability, in English and learn to receive 
by words instead of by letters. Finally, they can receive phrases or words 
as units. 
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Figure 37. Learning curve for receiving telegraphic cod 
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curve just before the minimum rate is reached. (After Bryan and Harter, 


Bryan and Harter argued that the plateau ought to occur at the transi- 
tion between learning to receive individual letters and learning to receive 
Words. They were able to justify this interpretation by a study of the way 
in which people learned to receive jumbled letters rather than real words. 
They found that the plateau for meaningful material occurred at just about 
the point at which there was no further improvement in the ability to receive 
disconnected letters and words. Thus, the plateau occurs where the limit 
On the lower-order habits has been reached and where the higher-order 
habits have not yet begun to appear. The results of this comparison of dis- 
Connected and connected material can be seen in Figure 38. 

It is obvious that the notion of hierarchy of habits is useful whether we 


Are interested in plateaus or not. Something like this hierarchical structure 
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must occur in all tasks in which there is opportunity to recode the material 
into larger and larger units. Thus, learning to typewrite, read music, and the 
basic skill of learning to read itself must involve something like hierarchies 
of habits. 

Plateaus do not always occur. Indeed, in telegraphy they seem to be the 
exception rather than the rule (Taylor, 1943). The transition between or- 
ders of skills is more often a smooth one. Also, there seem to be other causes 
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Figure 38. Learning curves for receiving connected discourse, disconnected words, and discon- 


nected letters. Notice that the Plateau in the curve for connected discourse occurs at the point where 


there is no further improvement in the curves for disconnected material. (After Bryan and Harter, 
7899.) 


of plateaus in the learning curve. Swift (1918) argued that plateaus are 
caused by a decline in the learner’s motivation. He pointed out that pla- 
teaus for all learners occur in about the same place; furthermore they occur 
in many tasks which do not involve a hierarchy of habits. Book (1925), in 
his handbook on learning to typewrite, located the lapses in attention and 
drop in motivation at the point where letter habits were making a transition 
into word habits. Thus the failure to improve generated discouragement, 
which furthered the failure to improve. Individuals may become so discour- 
aged at failure to improve that they lose motivation for further practice. 
Indeed, plateaus may well be a fact which keeps adult learners from ac- 
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quiring such skills as playing tennis or playing the piano. A well-motivated 
adult can learn the fundamentals of these skills quite easily, but there then 
follows a long period when devoted practice brings no fruits. At this point 
interest drops and the individual will cease taking lessons. 

Frequently, the limit of performance which learners finally reach is a 
Plateau. The author doubts that his typing has improved much in the past 
fifteen years, but there is certainly room for improvement. A little practice 
Would eliminate some bad habits and bring in some new levels of organiza- 
tion which would no doubt quickly raise his level of typing. Thus, while he 
is Currently at a limit of performance in typing, practice would raise that 
limit. There are surely real physiological limits to our ability to perform 
specific skills; we cannot type faster than our fingers will move. Most of us, 
however, seldom push ourselves to this limit, 


CONDITIONS OF PRACTICE 


The learning of any particular skill is a function of the conditions under 
Which it is practiced. The shape of the over-all learning curve may be al- 
tered by change in conditions, or the presence or absence of particular 
components of the skill may be altered. In this section we shall examine the 
influence of some of the more important conditions of practice which have 
been experimentally studied. We shall restrict ourselves to conditions that 
apply to many tasks, since we are interested in general principles of learning 
rather than the characteristics of certain tasks. To take an example: We 
might be interested in the influence of the location of individual letters on 
the typewriter keyboard, since this could tell us something about general 
Problems of habit interference, but we should not be interested in the influ- 
ence, say, of depth of keystroke on learning to typewrite. The latter prob- 
lem may well be of interest to those concerned with the specific skill of 
typewriting, but it is of little general interest in the psychology of learning. 


The Distribution of Practice 


The condition of practice that has been explored experimentally more 
extensively than any other is that of the distribution of practice through 
time. For a wide variety of tasks, the question has been asked, Is it better 
to practice a task with as little interruption for rest as possible, or is rest 
beneficial to learning and performance? For most of the experimental tasks 
that have been studied, the results show that some rest is better than none 
at all, and for some tasks the advantage in increased skill is quite large. 
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The problem is complicated; there are many tasks for which distribution 
of practice has little effect, and there are a few for which rest periods actu- 
ally seem to decrease ability. Because of the complexity of the problem and 
because of the theoretical interest in the distribution of practice, it will be 
necessary for us to examine this condition in considerable detail. Let us 
first look at the experimental findings and then at some of the theoretical 
questions. 


Experiments on the Distribution of Practice 


A simple experiment. A well-known experiment by Lorge (1930) illus- 
trates the basic phenomenon in the distribution of practice. Lorge studied 
the effects of continuous practice for 20 trials compared with the effects of 
Practice in which a rest period of one minute or a rest period of one day 
intervened between each trial. The tasks he used were mirror drawing, 
mirror reading, and code substitution. He found that both cases of distrib- 
uted practice resulted in better learning than massed practice. Figure 39 
shows the results for the mirror-drawing task. We can see that the time per 
trial for the two groups with distributed practice dropped much more rap- 
idly than the curve for the massed-practice group; the difference is apparent 
almost from the very beginning of practice. Notice also that the difference 
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Figure 39. The effect of distribution of practice upon mirror drawing. Notice that, apart from 
the very first trials, there is almost a constant difference between the different groups. (After Lorge: 
1930.) 


THE LEARNING OF SKILLS AND CONDITIONS OF PRACTICE 189 


between the group that rested for one minute between each trial and the 
group that rested for one day is very small. 

This experiment illustrates the basic effect of distribution in a perceptual- 
motor task of moderate difficulty. As is typically the case for such tasks, 
the difference produced by some distributed practice compared with none 
is large and consistent. 

The effects of distribution when practice is unequal. In the experiment 
Just cited the amount of practice for the various groups was held constant 
and the time over which practice occurred was varied. What happens, how- 
ever, when the time over which practice occurs is held constant and the 
Amount of practice varied? In this case, a group of subjects under massed 
Practice would receive more practice than one under distributed practice. 

Duncan (1951) studied the effect of distribution of unequal practice on 
the acquisition of the ability to keep a small stylus on a constantly moving 
target (rotary-pursuit test). He allowed some of his subjects to practice 
Continuously; the remainder of his subjects were periodically interrupted for 
A rest period. The rest periods actually occupied two-thirds of the practice 
Period, so that the group practicing continuously had three times as much 
Practice. Despite this heavy balance in favor of the massed-practice group, 
distributed practice produced better performance. The distributed practice 
Sroup maintained the superiority it had shown during the initial learning 
after a pause of 10 minutes. Therefore it is clear that the effects of distribu- 
tion are powerful and consistent. 

Length of work and rest periods. It is reasonable to suppose that the 
relative effectiveness of distributed practice would depend upon the abso- 
lute and relative lengths of the work and rest periods. There are a number 
Of experiments that explore this problem and provide us with important 
Clues about the underlying mechanism responsible for the distributed- 
Practice effect. 

Kientzle (1946) studied the effect of varying the rest period while hold- 
ing the work period constant. The measure of performance she used was 
the number of alphabet letters subjects could print upside down in a one 
Minute trial. She varied the rest intervals between trials from zero to seven 
days. Her results can be seen in Figure 40. In general, her experiment 
Showed that for this task small rest periods resulted in a great improvement 
OVer no rest between trials, but that beyond 45 seconds, not much advan- 
tage was gained by increasing the rest interval. 

In another experiment on the alphabet-printing task, Kimble (1949b) 
Showed with trials of 30 seconds duration that the advantage of spacing 
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trials kept on increasing from zero to 30 seconds rest. Since Kimble did not 
study distributions greater than 30 seconds, we do not know whether there 
would have been any advantage gained from even longer rest periods. 

In the two experiments just discussed, the duration of the work period 
was kept constant. What is the interaction between the length of the work 
period and the length of the rest period? In one experiment (Kimble and 
Bilodeau, 1949) the task was rapid manipulation of small pegs, and it was 
found that shortening the work period is much more important than length- 
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Figure 40. The effect of time between trials on performance of a learned act on a particular trial. 
Notice that for this task (printing upside down) rest intervals greater than 45 seconds do not appre 
ciably increase level of performance. (Data from Kientzle, 1946.) 


ening the rest period. Thus, the Optimal rest period for any given task prob- 
ably depends upon the duration of the work period during practice. From 
the available evidence, it is likely that short Work periods and short rest 
periods are most beneficial to practice, but the actual duration of the work 
period will clearly depend upon the nature of the task. It will not do, for 
example, to break practice into unnaturally short units for the task in ques- 
tion; interrupting someone solving a puzzle before he has reached solution 
may well have disrupting effects upon practice. 

The interaction of distribution and nature of the task. A large literature: 
only a fraction of which is cited here, Supports the contention that some dis- 
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tribution of practice is better than none for nearly all psychomotor skills. 
We do not know very much about how distribution of practice interacts 
with the characteristics of specific tasks, however. For theoretical reasons 
(see below) it has commonly been supposed that distribution ought to be 
more effective for tasks involving a large amount of physical work and 
effort expenditure than for those involving little work and effort (see Ellis, 
1953), but there has been little systematic exploration of task-distribution 
interactions for psychomotor skills. 

Underwood and his associates, however, have studied rather exhaustively 
the relationships between distribution of practice and various conditions pre- 
Vailing in rote verbal learning. The results are rather complicated. For ex- 
ample, one study (Underwood and Goad, 1951) showed that distributed 
Practice was Superior to massed practice for learning lists of adjectives 
by the method of serial anticipation if the similarity between adjectives was 
high. If the similarity between adjectives was low, massed practice was as 
effective as distributed practice. On the other hand, irrespective of the 
similarity of items, learning adjectives by the method of paired associates 
Was unaffected by distribution (Underwood, 1951). Again, to make the 
Picture more complicated, Oseas and Underwood (1952) find that dis- 
tributed practice benefits the learning and retention of concepts, when the 
method requires subjects to abstract concepts by being exposed to succes- 
Sive examples of the concept. Furthermore, comparisons of the results 
achieved by Underwood and his associates with those achieved by another 
study (Hovland, 1949) suggest that the level of skill or sophistication of 
Subjects in laboratory learning may interact with the distribution of prac- 
tice (Underwood, 19536). 

The net result of all the intensive work on the effects of distribution of 
Practice on a particular kind of learning has been to show that it is ex- 
tremely difficult to apply a simple theory to all of the known phenomena of 
the distribution of practice. This is important, because much of the interest 
of experimental psychologists in the distribution of practice is not so much 
in the effect itself as it is in the means the effect has provided for testing 
Certain theoretical notions about learning and skilled behavior. We shall 
Teturn to some of these problems when we consider theories of the effects 
Of distributed practice. 

The permanence of the effects of different distributions. We have seen 
that for the vast majority of psychomotor skills, distributed practice is su- 
Perior to massed practice. For many verbal tasks this is also true. It is to 
be noted that wherever the effects of distributed practice have been studied 
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in realistic training programs, distributed practice has proved to be superior 
to massed practice. For example (Crawford, et al., 1947), in one study of 
training fighter pilots to fire at towed targets, the results clearly showed that 
if a fixed amount of ammunition was distributed over many training mis- 
sions the percentage of target hits was as much as five times better than if 
all the ammunition was fired on one mission. 

An important question we have not yet answered concerns the perma- 
nence of the effects of distributed practice. If we teach two different groups 
of people a simple task, one group practicing with distribution and the other 
with massed practice, the chances are that we shall establish a difference 
in performance at the end of practice. If we wait a period of time, however, 
before measuring retention, will the difference between these two groups 
disappear or be maintained? 

One kind of experiment in which the permanent effects of distribution 
have been important is found in studies of reminiscence. Reminiscence is a 
somewhat curious name, only remotely related to the usual context of the 
Word, but it is commonly used in experimental research on learning, and 


we shall use it here. Reminiscence occurs when individ 


uals practice at some 
Skill, stop, a 


nd after an interval of time are tested for further performance. 
The characteristic defining reminiscence is that there is an actual improve- 
ment instead of a decrement in performance (as there would be if forgetting 
took place). 

The original studies of reminiscence occurred in rote verbal learning 
and led psychologists to believe that it was a fairly general phenomenon in 
such learning. More recent studies suggest that in rote verbal learning it is 
limited to the learning of material like nonsense syllables when practice is 
severely massed (Hovland and Kurtz, 1951). Under these conditions there 
appears to be a slight temporary increase in the ability to recall nonsense 
syllables after a little time has elapsed. 

If the phenomenon of reminiscence were limited to rote verbal learning 
of nonsense syllables it would not be of much significance, since it occurs 
in limited amounts and only under highly specialized conditions for even 
this kind of learning. It turns out, however, that considerable amounts of 
reminiscence occur in the learning of psychomotor Skills. 

The importance of reminiscence to the Problem of the distribution of 
practice is that it seems to be a function of the massing of practice. In the 


case of nonsense syllables, no reminiscence occurs with distributed practice, 


but some does occur with severely massed practice. The implication is 


clear; with the passage of time some of the difference induced by massed 
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and distributed practice disappears—the massed-practice group performs 
more like the distributed-practice group. Two important questions arise. 
Does this difference in reminiscence between massed and distributed prac- 
tice occur in psychomotor skills? If so, does the difference between massed 
and distributed practice entirely disappear after a time interval? 

The answer to the first question appears to be clear enough. In psycho- 
motor skills, the amount of recovery during a rest period seems to depend 
upon the amount of massed practice that has gone before. The second 
question poses a more complicated problem. Under some conditions the 
difference between massed and distributed practice does entirely disappear. 
For example, in an experiment by Kimble (1950) on a psychomotor skill, 
Subjects practiced at the task under either severe massed or distributed prac- 
tice. After a six-minute rest period, despite large initial differences in per- 
formance between the two groups, the group that practiced under massed 
conditions did as well as the one trained under distributed conditions. In 
other words, reminiscence from massed practice made recovery from the 
effects of massed practice very nearly complete. 

This case of complete or nearly complete recovery (some ambiguity 
exists because of warming up when returning to a task after rest) is by no 
Means the rule. It may, indeed, be the exception. For much the same kind 
Of task, Jahnke and Duncan (1956) discovered that a difference produced 
by massed or distributed practice remains after an interval as long as four 
weeks, although the difference between massed and distributed practice 
after so long an interval is smaller than it was originally. Reminiscence oc- 
curs, but not so that the difference between the massed and distributed 
Sroups is wiped out. 

In verbal learning, much the same sort of situation occurs. Sometimes 
Teminiscence shown: by a massed-practice group of subjects will wipe out 
Performance differences between massed and distributed practice and some- 
times it will not. Even if reminiscence as usually defined does not occur, 
it is possible that the difference between massed and distributed practice 
could disappear in a later recall. This could happen if both the massed and 
distributed group forgot enough items to obtain equally poor recall COTES: 
Again, on this issue, the results of different experiments are not in 
dgreement. Retention of verbal material learned by rote methods (either 
Serial-anticipation or paired-associate) may or may not be enhanced by 
distributed practice (see Underwood, 1953a). The reason for this lack of 
Uniformity in experimental findings is important, and we shall ln to itin 
a discussion of the theoretical issues in the distribution of practice. 
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Transfer of the effects of massed practice. Jt is clear that massed prac- 
tice has some detrimental effects on performance, and these effects may or 
may not be permanent in nature. They turn up more consistently and 
clearly in psychomotor skills than in rote verbal learning. One important 
question has concerned the physical locus of these effects, for if distribu- 
tion is more effective in behavior which clearly involves large scale mus- 
cular movements, the suggestion immediately comes to mind that this might 
be because of some fatiguelike inhibition in the peripheral response mecha- 
nisms. Consequently a series of studies have been performed with a view to 
discovering the locus of the effects of massed practice. These experiments 
have usually been on bilateral transfer. 

In bilateral transfer, a subject is taught to perform some simple act of 
coordination by practicing with one of his hands only. After he has reached 
a certain level of skill, he is tested for this act by the other hand. If the sub- 
ject is able to do better with the unpracticed hand than a completely un- 
practiced subject would do, we say that bilateral transfer has taken place. 

Let us look at an example of the bilateral transfer of reminiscence (Irion 
and Gustafson, 1952). In this experiment subjects practiced continuously 
for five minutes at following a moving target (pursuit rotor). After prac- 
tice, one group of subjects immediately received five minutes additional 
practice with the other hand, while subjects in a second group rested for five 
minutes before changing hands. It turned out that the group that rested 
Was significantly superior with the other hand to the group that transferred 
immediately. Thus the detrimental effects of massed practice transferred 
from one hand to the other; they were not limited in location to the muscles 
involved in the original practice. This effect has been repeatedly confirmed 
(see Rockway, 1953), and it must be seriously considered in the theoretical 
issues concerning the distribution effects. 

Theories of distributed practice. Current attempts to account for the 
comparative effects of massed and distributed practice for the most part 
have emphasized the role of inhibition from massed practice. Some people 
have argued that the advantage from distributed practice is mainly the re 
sult of rehearsal during the rest intervals, but the evidence is against this 
notion on a number of counts. First of all, many experiments have been 
controlled for possible rehearsal, and these have still found superiority in dis- 
tributed practice. Secondly the effect is more stable and larger in magnitude 
with psychomotor skills than with verbal material, and, intuitively, it woul 
seem that verbal material would be easier to rehearse away from the testing 
situation than psychomotor skills. Finally, the effect also occurs in animal 
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learning (Bunch and Magsdick, 1933; Holland, 1953), where rehearsal 
is improbable. 

We have already had occasion to examine an important theory about 
inhibition. In the chapter on extinction, we examined a theory of extinction 
that attempted to account for the experimental evidence on the basis of two 
inhibitory states. One inhibitory state, reactive inhibition, is the simple re- 
sult of responding. It builds up through continued responding and disap- 
Pears through rest. A second inhibitory state, conditioned inhibition, de- 
pends upon the first; the organism learns not to respond (or, perhaps, not 
to respond well) as the result of being induced not to respond through 
reactive inhibition. This notion, originally advanced by Hull (1943), has 
been modified and extended by several investigators. It was originally put 
forward to account for some of the basic effects in experimental extinction. 
As we have seen, it was only partially successful in this context; something 
like the notion of reactive inhibition seems to have survived experimental 
test—there does seem to be some temporary inhibition to further respond- 
ing generated by behavior—but the concept of conditioned inhibition did 
not survive the experimental onslaught in the study of extinction. 

Since, as we have just seen, the decremental effects of massed practice 
Seem to result in both a temporary inhibition (hence recovery as it is shown 
by reminiscence) and a relatively permanent decrement, several investi- 
ators have assumed reactive and conditioned inhibition to be operating; 
they have tried to find techniques to evaluate unequivocally the role of these 
inhibitions in the distribution of practice phenomenon (Ammons, 1947; 
Kimble, 19492). The logic of the technique for assessing the role of these 
two inhibitory factors is straightforward. Two groups of subjects are com- 
Pared, one group working under massed and the other under distributed 
Practice. The difference in performance between the two groups at any 
Point represents the sum of reactive and conditioned inhibition. At some 
Point a rest period is introduced (to allow reactive inhibition to disappear); 
the gain in performance after rest represents the amount of reactive inhibi- 
tion that has disappeared, and the difference, after the gain, between the 
Massed and distributed group represents conditioned inhibition. 

Despite the simplicity of this analysis, it has many pitfalls. First of all, 
it assumes that all the reactive inhibition dissipates during rest. Secondly, it 
Assumes that practice in the distributed group was so well separated that 
10 inhibition occurred. Thirdly, it assumes that despite a difference in per- 
formance the two groups learned equally (it would not do to have a 
difference in habit strength as well as in inhibition between the groups). 
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Finally, it does not take direct account of the warm-up that must occur 
after a rest. All in all, this is a rather forbidding list of assumptions; so 
we see that the simplicity of the analysis is more apparent than real. 

Some investigators have tried to control or analyze the factors going 
into these assumptions (Wasserman, 1951), but the results have not al- 
ways been convincing. Therefore an exact test of the application of formal 
inhibitory constructs to the massed and distributed-practice phenomenon 
has not yet occurred. Furthermore, as Underwood pointed out in his previ- 
ously mentioned series of papers on distribution in verbal learning, a whole 
host of detailed experimental results cannot be accounted for by these 
notions. 

At the most general level, we can say that there appear to be at least two 
components to the distributed practice effect, a permanent and a temporary 
one. Furthermore, neither of these seems to be localized in peripheral struc- 
tures; they are probably central in origin. Beyond this we cannot specify. 
It is likely, however, that some of the difference between massed and dis- 
tributed practice in recall, even when the recall is separated in time from 
original practice, reflects an underlying difference in what has been learned. 
In an earlier section we saw that for most complicated tasks, doing is im- 
portant to learning. Therefore if the decrement produced by massed prac- 
tice is severe enough to prevent individuals from performing adequately, 
it is likely that they will not learn adequately. Even so general a notion as 
this does not allow us to understand in detail some of the effects of distri- 
bution. For example, Underwood (1953a) points out that in rote verbal 
learning by serial anticipation, items anticipated frequently during learning 
are recalled better after massed practice, while items infrequently antici- 
pated during learning are recalled better after distributed practice. This 
suggests some kind of interaction between distribution and rate of for- 
getting. If this is so, then the Principles governing the permanent effects of 
the distribution of practice for material that is easily forgotten will be 
different from those governing the same effects for material less easily for- 
gotten. Again, this interaction may well account for the greater consistency 
of findings on the effects of distributed practice on psychomotor skills, 
since these skills are about equally easy to remember. 

The earlier theories and to a certain extent the current theories of the 
distribution of practice are based upon inadequate empirical exploration- 
This has not been because there has been a lack of experimental investiga" 
tion but rather because the problems are complicated. The future course 
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of theoretical analysis of this problem will require a further systematic in- 
vestigation of the principal variables. 


Information and Effect in the Learning of Skills 


There is a famous demonstration by Thorndike (1932b) of the im- 
portant fact that the consequences, or aftereffects, of an act are important 
determiners of the future course of that act. If a blindfolded person is re- 
quested to draw, with one quick move of a pencil, a line 3 inches long, his 
accuracy will be poor. If he continues to “practice” without being able to 
see what he is doing, his performance will not materially increase in ac- 
curacy, though it may become less variable. However, if an experimenter 
measures the drawn line and says “right” if the line is within ¥8 inch of 3 
inches and “wrong” if the line deviates by a greater amount, the subject 
will rapidly show improvement in his ability to approximate the 3-inch 
Criterion. 

This demonstration has been the subject of an enormous amount of 
mMisinterpretation. It is meant to show the importance of the aftereffects of 
a response in determining its direction of change. By implication it is fre- 
quently taken to mean that the reward (“right”) satisfies the subject's mo- 
tivation to increase his accuracy and that this has something to do with the 
improvement. To argue whether or not this is so easily leads us into a 
morass of slippery concepts. It may well be true (depending upon how 
Systematically defined our notions of satisfaction and motive are), but 
there is no critical reason why it is necessary to describe this demonstration 
in these terms. The line-drawing demonstration shows only the operation 
Of the empirical principle of reinforcement, which says that there is a class 
Of stimuli to selectively strengthen every response. 

“Right” as a reinforcement is trans-situational (see Chapter 2); that is to 
Say, it will strengthen responses of a wide variety of classes. Almost any 
Sort of information that leads the human learner to make what the experi- 
menter or teacher defines as a correct response is a reinforcement, though 
some such informational reinforcements are clearly not trans-situational. 

Knowledge of results in a training program. A good example of the 
difficulties involved in getting the correct information to the learner is found 
in flexible-gunnery training. In firing guns it is sometimes difficult for the 
trainee to know if and when he hits the target. During the Second World 


War various training devices were developed to give individuals information 


about their accuracy in firing. One of these, the Waller trainer, was a screen 
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Which simulated the sky and against which images of planes were thrown; 
a simulated gun and sights enabled the trainees to fire at the images. The 
apparatus was rigged so that it would score a “hit” whenever the individual 
had the sights lined upon the image of the plane and was pulling the trigger. 
In addition, there was a tone—a “beep”— which was sounded whenever 
the individual made a hit. This could be thrown in or out of the circuit so 
that the effect of this piece of information could be studied. Figure 41 tells 
the story. It is obvious that when the tone was sounded for a hit, the sub- 
jects did much better. When the tone was sounded early in practice it re- 
sulted in a greater number of hits. Later, when the tone was withdrawn from 
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Figure 41. The effect of knowledge of results upon learning to hit a target. Whenever a hit was 
made, the tone was sounded. One group received the tone at first and not later. The other group 
received no tone at first but then heard one on the later trials. (Hobbs, 1947.) 


this group and given to the second group, the second group immediately 
increased in number of hits, so that it now Surpassed the first group. Unfor- 
tunately, the investigators did not include a third group that was never 
given the tone. 

The same group of studies (Hobbs, 1947) shows that where there was 
little opportunity for knowledge of results, as in the gun camera mounted 
in the tail of a bomber, relatively little improvement in score took place. 
This fact ruled out several otherwise excellent, realistic training devices. A 
second important point is made by the same studies: Sometimes the subject 
gets the wrong information. Knowledge of results of training with a sta- 
tionary gunsight and target might be misleading, because in actual prac- 
tice the gunner must fire at a moving target from a moving plane and must 
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learn to “lead.” Thus knowledge of results on a training device could actu- 
ally lead to negative transfer on the Job. While this may not happen often, 
it is Something which needs to be watched very carefully in any training 
Program. 

Delay of information and learning. In examining the influence of rein- 
forcement upon animal behavior we saw that a delay in reinforcement may 
disrupt behavior and slow down the rate of learning. Is the same thing true 
Of Teward and information in adult human learning? At first thought we 
should Probably say no, because the symbolic processes in man would 
enable him to bridge anything but an enormous delay between the occur- 
Tence of a response and the reward or information that comes as its conse- 
quence. Let us, however, examine some of the experimental evidence. 

In one experiment (Lorge and Thorndike, 1935) subjects learned to 
toss a ball at a target they could not see. Some subjects were told how well 
they did on a particular throw immediately after the throw; for other sub- 
Jects the information was delayed for various short intervals of time. In 
Seneral, as one might expect, there was little or no difference due to Vary- 
ing intervals of delay in the learning of this task. If, however, the interval 
between throwing the ball and the information about that throw was filled 
With another throw, the gain in accuracy was impaired. This result suggests 
thatit is more than simply the passage of time that is important in the effects 
of delay upon information and reward (we concluded much the same thing 
in the case of delay of reinforcement on animal behavior). f 

Saltzman (1951) has demonstrated that a brief delay in reward and 
information in rote verbal learning can have considerable effects upon the 
umber of errors made by subjects during learning. A delay as short as six 
Seconds increased the number of errors by 50 per EES points 
Out that his subjects rehearsed the response between the i 
And the delay of information, and it is quite possible that “ PE CE 
Vance of knowledge of the adequacy of the কা ্‌ may 3 
Slower learning. A series of systematic experiments on the influence of suc 
factors i | l 

Monn orf and information. It is quite ee ER 
about the adequacy of his response allows a learner toa h 1s perform- 
Ance more nearly to approximate a norm. Thorndike CE oo 
however, that this was not the only way to look at reward and information. 
He believed that rewards had automatic effects on that LE over 
And above any deliberate use of information on the learner’s 0 rewards 
“Stamped in” preceding responses or stimulus-response connections. 


200 THE PSYCHOLOGY OF LEARNING 


A series of experiments on a phenomenon called the spread of effect con- 
vinced Thorndike that rewards acted in this automatic, blind fashion. 
Thorndike’s (1933) original experiments seemed to show that the conse- 
quences of a response (reward or punishment) were not specific to the 
response which they followed but spread more or less automatically to re- 
sponses temporally adjacent to that response. 

The experimental design which Thorndike used was rather unusual. 
Subjects are given a series of items to which a number of responses are 
possible, as in a multiple-choice test. The subjects are instructed to choose 
a response from those available on each trial. Since the subject does not 
know which responses are supposed to be associated with which word, his 
responses at the beginning should be fairly random. In one experiment, for 
example, the subject is read a long list of words. He responds to each word 
by guessing a number from 1 to 10. The experimenter follows the subject's 
number with “right,” “wrong,” or nothing at all. The subject believes that 
he must learn to associate the right number with the right word. The ex- 
perimenter, however, has decided beforehand which of the words he will 
say “correct” to, no matter what the subject gives. The subject is told that 
more than one number might be correct for any word, so that there is no 
way in which he can know that there is nothing for him really to learn. 

The effect Thorndike found was that the statement “correct” not only 
increased the probability of repeating the response that it followed, but it 
increased the probability of repeating the responses before and after the 
response rewarded. This you can see in a typical spread-of-effect gradient in 
Figure 42. The responses on either side of the middle response had nothing 
following them; however, they did increase in per cent repetition over 
chance expectancy. 

At the time Thorndike’s results were published, they did indeed seem 
to give strong support to the notion that the consequences of a response 
operated by the principle of effect as well as by information. In the inter- 
vening years, however, a number of complications have been found in this 
kind of experiment. First of all, it appears that it is not a stimulus-response 
connection that is strengthened; it is just the habit of guessing one particular 
number after some other number (Zirkle, 1946). Even when the order of 
items given to the subject is changed on every trial, the spread of effect 
remains. 

A number of other experiments combine to show almost overwhelmingly 
that the spread of effect in this type of experiment is primarily due to the 
fact that human beings seldom respond randomly, and the effect does not 
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seem to be related to the principle of effect at all. These experiments show 
that the guessing habits of subjects are the important determiners of the 
effect. When no opportunity for guessing habits is permitted, no spread of 
effect appears (Jenkins and Sheffield, 1946; Jenkins and Postman, 1948). 
On the other hand, Sheffield (1949) has shown that pure guessing without 
any information or reinforcement will result in an aftergradient like that 
found in the spread of effect. So it seems that the dependent probabilities 
in human verbal behavior provide the background for the spread of effect 
(Smith, 1949). Finally, Sheffield and Jenkins (1952) have shown that the 
more closely an experiment approximates Thorndike’s original one without 
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Figure 42, The spread-of-effect phenomenon. The probobility of 
item in a series is increased, as illustrated in this hypothetical example. 
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appear to be reasonable under certain conditions, but it is not the same as 


Spread of effect, and it adds nothing to the notion of the automatic strength- 
ening effect of Heard. Rather a reward appears to function (1) to keep 
activity Eoing (as it does for animal behavior) and (2) to provide informa- 
tion about the adequacy of past behavior and the direction future change of 


havior should take. 
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Other Conditions of Practice 


Of course there are a large number of factors that determine the efficiency 
of learning for particular tasks in formal training programs or in schooling. 
Sometimes these factors are so highly specific to particular tasks that they 
are of little general interest in the Psychology of learning, and sometimes 
they are unsystematic arrangements of more basic variables. For example, 
consider the problem of active participation in learning. 

It is no surprise to anyone that active Participation in a task yields supe- 
Tior performance, recall, or transfer for that task. For example, interruption 
of an educational film for multiple-choice questions increases the effective- 
ness of the film as a teaching device (Kurtz, Walter, and Brenner, 1950), 
though perhaps, rather surprisingly, no more so than simply repeating the 
film a second time. Obviously, the questions provide more active participa- 
tion on the part of the viewers than does merely watching the film. 

When we ask why active participation is more effective than passive 
participation, we immediately see that it is a complicated question. Par- 
ticipation varies for different tasks, and it must have somewhat different 
effects. It may supply the learner with knowledge of results with short 
delay, increase the learner’s motivation, or help him eliminate errors early 
in practice. 

What is surprising, perhaps, is that retention of verbal material is better 
When time is taken away from reading or study of the material and put into 
active recitation. In one study (Gates, 1917) subjects who spent up to four- 
fifths of their total study time in recitation rather than in reading showed 
the greatest improvement. The advantage gained from recitation is fre- 
quently large, and this is one of the points about habits of study that is 
emphasized in manuals on study habits (see Robinson, 1946). Very likely 
recitation is important to E0Od study habits because it provides direction 
to the learner's work and readily supplies information or knowledge of re- 
sults about performance. 

Another problem that sometimes arises in application of principles of 
learning to formal training or schooling is the question of whole and part 
learning. This originally arose in the context of rote memorization, where 
perhaps it makes most sense, and it was concerned with whether it is more 
efficient to memorize by reading the material from beginning to end each 
time or by breaking the material into its constituent parts and studying each 
separately. 

A little reflection will show that even in the limited context of rote 
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memorization, the question of whole and part learning is complicated. What, 
for example, is the role of length of task? Perhaps for short tasks, such as 
memorizing a sonnet, the whole method would be better, while for longer 
tasks, memorizing a book from Paradise Lost, for example, the part method 
might be more efficient. Unfortunately we do not have Systematic informa- 
tion to answer the question about the relative efficiency of whole and part 
learning for all tasks. We cannot even make a reasonably sure guess, as we 
might in the case of massed versus distributed practice. We do know that 
Some conditions favor whole learning and others favor part (Jensen and 
Lemoire, 1937), but we do not know enough about the interaction of this 
Problem with other methods of practice to make clear recommendations 


about a wide variety of tasks. 


TASK VARIABLES 


Let us now consider how some of the important differences between 
Various tasks affect learning. No task that a human being learns is quite 
like any other. Some of the differences between tasks are trivial and others 
Are of limited general interest. There are, however, a number of ways in 
Which tasks differ that are fundamental to the psychology of learning. 

First of all there is the important difference between those tasks that are 
Primarily verbal and those that are not. Then there is a dimension of per- 
ceptual emphasis along which tasks vary; some are primarily a matter of 
leaning to make discriminations and to form new perceptual organizations. 
Other tasks are primarily a matter of adjusting motor performance to meet 
Some norm of skill. Still others are matters of conceptual organization. In 
tasks that demand simple memory, there is an important dimension that 
“uns from demanding memorization of a collection of unstructured elements 
Fl demanding memorization of highly organized and meaningful material. 
Compare, for example, an attempt to memorize a list of 100 nonsense sylla- 
bles with an attempt to memorize 100 successive words from a story in a 
Popular maoazi 

Then hee two important dimensions which run through prac- 
tically all types of tasks—difficulty and length, or amount, of material to 
5 learned. Though we regard these as fundamental dimensions, they are 
not Necessarily pure. Task difficulty is frequently correlated with ad 
Way in which tasks vary (meaningfulness, for example), and it is I 
Means a simple variable. As a matter of fact, such variables as task diffi- 
culty are Probably only rough empirical classes, and the future development 
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of learning theory may enable us to dispense with such notions altogether. 
For now, however, a considerable part of our understanding of the basic 
mechanisms in the psychology of learning is tied up with such variables as 
difficulty and amount, and we must regard them carefully. 


Difficulty 


A difficult task is generally one that takes us a relatively long time to 
learn. Yet, it is not possible to equate difficulty with the length of time or 
the number of trials it takes for a learner to reach some arbitrary criterion, 
since other conditions influence the length of time spent in learning. The 
abilities of the learner, his motivation, the amount of preceding practice, 
and the distribution of work are all factors that will influence the length of 
time it takes to reach some criterion of performance. We can, however, 
define a change in task difficulty when we keep all conditions, such as distri- 
bution of work, motivation, ability, constant and vary some aspect of the 
task in such a way that the time taken to reach a criterion is changed. 


Difficulty and Response Characteristics 


Number of available responses in trial-and-error learning. Trial-and- 
error learning generally refers to those situations in which learners cannot 
make use of a general principle or of previous knowledge, and in which they 
must try to find the correct solution by exploring the available alternatives. 
In such learning an important determiner of difficulty is the number of 
choices the learner is given (Noble, 1956). The larger the number of al- 
ternatives (assuming the number of correct alternatives remains the same), 
the greater the difficulty of the task. For example, in one experiment 
(Noble, 1955) subjects had to learn to push four buttons in a particular 
sequence; in this situation the number of trials required to reach an arbi- 
trary criterion was enormously increased when the number of available 
buttons increased. If only four buttons were present, they were all relevant 
to the task, but when the buttons were increased to 10 in number, 6 of them 
were irrelevant and the subjects had to learn to eliminate these. 

Interaction between number of available responses and task length. 
The influence of number of alternatives on performance is modified by the 
length of the task. This is demonstrated in a series of studies by Brogden 
and Schmidt (1954a, 1954b). These investigators studied the effects of 
increasing both the number of response alternatives available at each choice 
point and the total number of choice points. They used a verbal maze, one 
in which the learner must find his way by guessing which of a number of 
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available responses at each choice point will allow him to go on to the next 
choice point and eventually the end of the maze. In their experiments, Brog- 
den and Schmidt studied two different lengths of maze, one 16 units long 
and another 24 units long. 

The results of this study can be seen in Figure 43. Both the number 
of errors and the time taken to reach a criterion score increased linearly 
With an increase in the number of alternatives. It is interesting to note, how- 
ever, that for the two mazes of different length, the rate of increase in time 
and errors with increasing number of alternatives is different. Apparently, 
the longer the maze the greater the difficulty added by increasing the num- 
ber of alternatives at each choice. A careful examination of the curves in 
Figure 43 will reveal that the increase is disproportionate in the case of the 
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Figure 43. Curves showing the increase in errors and time to criterion with an increcse in the 
number of choices at each choice point in verbal mazes. The dotted curves are for mazes 16 units 


long and the solid ones for mazes 24 units long. (Brogden and Schmidt, 1954b.) 


r time and greater number of errors 
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Onger maze. In other words, the greate 
are more 


to criterion is not simply accounted for by the fact that there e 
Choices to be made; increasing the number of choices and increasing the 
length of the maze seem to have different effects. | 
In these studies one important feature turned up that is of critical interest. 
1 general, increasing the number of alternatives at each choice did not in- 
Crease the number of trials required to criterion. Thus, while the total number 
of errors made by subjects and the total time to criterion were increased by 
adding to the number of choices, the subjects did not increase the number 
oF times they had to go through the maze in order to reach criterion. This 
tS in contrast with the results found by Noble (1955), and the answer lies 
“1 the fact that Noble’s definition of trials differs from that of Brogden and 
chmidt, In Noble’s study trials means the number of guesses the subject 
Makes; Brogden and Schmidt define trials as the number of times the sub- 
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ject had to go through the different choice points. Miller (1956a) has 
pointed out that the reason trials to criterion was not influenced in Brogden 
and Schmidt’s study was that subjects could organize the responses available 
so that they did not need to take into account all the possibilities. Once the 
correct choice is found, memory for this choice is not influenced by whether 
there are 4 choices or 10. What does make a difference, as we shall shortly 
see, is the total number of things it is necessary to remember. The number of 
alternatives will influence the total number of errors subjects make (and 
therefore the total time, since errors take time), because the greater the num- 
ber of alternatives the lower the probability of making a correct guess. If I ask 
someone to guess in which hand I have a penny, he has a better chance of 
guessing correctly than if I had asked which of eight pockets on my person 
contains a penny. Since, in Brogden and Schmidt’s study, the subject must 
eventually guess the correct alternative before going on to the next choice 
point, the total number of times the subject goes through the sequence of 
choice points is unaffected. 

Therefore, while increasing the number of alternatives will increase the 
number of errors made during learning, it will not increase the difficulty of 
remembering the correct response. 


Difficulty and Characteristics of the Material Learned 


The relationship of characteristics of a task to be learned and difficulty 
of learning is a fairly broad question. We can ask, for example, What is 
more difficult to learn, typing or driving an automobile? Such a question 
would have little general value, since these two tasks differ from one 
another in a large number of characteristics. If, however, we compare tasks 
which differ in only one characteristic we can learn something about how 
variation contributes to the difficulty of tasks. 

Association value. One important characteristic of verbal material is 
its richness of contextual association. Some words have a greater potential 
for eliciting associations than other words. Even nonsense words or syllables 
differ enormously in their potential for eliciting associations. Various lists 
of nonsense syllables have been carefully examined for their association 
value, and it is possible to choose syllables of different value for different 
kinds of experimental studies. 

Noble (1952a) has made use of an index of meaning applied to two- 
syllable nonsense and ordinary English words. This index was obtained by 
counting the average number of associations given during a 60-second 
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period to each word. The obtained index ranged from 0.99 for the word 
GOJEY to 9.61 for the word KITCHEN. It is interesting to note that such a 
nonsense word as ROMPIN has a higher derived index of meaning than such 
infrequent but real English words as ICON, MATRIX, and BODKIN. Noble 
(1952b) used these words in a study of rote learning by serial anticipation. 
It turns out that a list constructed of words with an average meaning index 
of 1.28 took almost three times as many trials, on the average, to learn as 
a list constructed of words with an average meaning index of 7.85. These 
results, taken together with earlier studies of three-letter nonsense syllables 
(McGeoch, 1930), make it clear that associative potential of individual 
Verbal items is an important determiner of ease of learning. It is worth 
noting that associative potential plays essentially the same role in paired- 
Associate learning (Noble and McNeely, 1957). 
Similarity. A variable of considerable importance from a theoretical point 
Of View is that of similarity among items in the material to be learned. 
Similarity, as we shall see in later chapters, is a rather complicated dimen- 
Sion. Any task consists of stimulus and response components. Both may 
Vary in similarity among the separate items and both may vary with respect 
to each other. In rote serial learning the same item serves as both stimulus 
and response in succession, while in paired-associate learning there is a 
Clearer separation between stimulus and response elements. Therefore, we 
Might expect some differences to exist between the effects of variation in 
Stimulus similarity and in response similarity. Hence, tasks in which there 
Are different relationships between the items must be considered separately. 
+ If we consider the rote serial learning of verbal items we see that an 
‘crease in the similarity between items generally results in an increase in 
he number of trials to criterion. For example, Underwood and Goad 
(1951) showed that lists composed of adjectives highly similar to one 
another, such as “elated,” “gleeful,” “carefree,” took ECE He io leam 
sed of unrelated adjectives like “worldly,” “fiery, 


« blotchy» The same thing is apparently true of nonsense syllables, in which 
[ Similarity is achieved, not by meaning, but by CUPLCarion of letters 
azarus, Deese, and Hamilton, 19 54; Underwood and Richardson, 1956). 
Underwood (1951) has also studied the influence of similarity upon 
Pairedassociate learning. In his study he compared learning of AD So 
Posed of distinctly different adjectives with learning of lists in which similar 
general, the more similar 
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Underwood did not conduct studies in which he varied stimulus similarity 
while holding response similarity constant or varied response similarity 
while holding stimulus similarity constant. From other studies, some of 
which we shall examine later, we can conclude that probably both stimulus 
similarity and response similarity increase the difficulty of a paired-associate 
task. High stimulus similarity produces high stimulus generalization from 
one item to another; since the stimulus items are less distinctive, they should 
be harder to discriminate. Likewise, if response items are highly similar, 
learners should have a harder time in finding and remembering their dis- 
tinctive properties. 

In general, it is probably safe to conclude that within a given task, high 
similarity of stimulus and/or response components increases the difficulty 
of that task. We should note, however, that this is only necessarily true 
where the learner is required to discriminate among all the components of 
the task. If, for example, any one of several responses will do in response to 
a given stimulus, response similarity could well help rather than retard 
learning. This case illustrates the essence of the difference between negative 
and positive transfer, and we shall have many opportunities to examine such 
cases in the next few chapters. 


Amount to Be Learned 


We hardly need an experimental study to tell us that the more material 
a subject has to learn, the longer the time required, or indeed, that a greater 
amount to be learned requires more trials for learning. The exact nature 
of the relationships between time and trials necessary for learning and 
amount of material to be learned is interesting. It is also somewhat sur- 
prising, and it will repay us to examine it carefully. A classical monograph 
by Lyon (1917) summarizes much of the best available evidence on these 
relationships, and we shall draw heavily on Lyon. 


Amount to Be Learned and Time per Item 


Nonsense syllables. Perhaps the most meaningful way to present the 
relationship between amount to be learned and time to learn is not to com- 
pare the total learning time with the number of items for different numbers 
of items, but to see how length of time to learn increases as items are added. 
Such a comparison can be seen in Figure 44. This figure shows the re- 
sults of two of Lyon’s experiments on the time required to learn lists with 
different numbers of nonsense syllables. In one experiment Lyon learned 
nonsense syllables by continuous practice, and in the other by repeating 
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the list only once each day. The curves show the time per syllable as a 
function of number of different syllables. As we can easily see, the time 
Per syllable increases rapidly with continuous practice. We can imagine, 
from our earlier discussion, that this very rapid increase in time per syllable 
Probably represents something more than the inherent difficulty of learn- 
Ing longer lists; this may represent the cumulation of response-produced 
inhibition as well. So, for our present purposes the curve for one trial 
Per day is probably more meaningful. It is rather surprising that while 
there is an increase in the time per syllable with longer lists, it is not a very 
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the immediate memory span.> Such a conclusion is in accord with the data 
shown in Figure 44 for the once per day method as well as data from other, 
earlier investigators. 

Meaningful material. Lyon also studied the amount of time that it took 
him to memorize poetry. The results with poetry were much like those he 
found with nonsense syllables. While additional material did require an 
additional amount of time per item, each item took less and less additional 
time. The items in this case were stanzas of poetry. Of course, the material 
is much longer, but each syllable of the poetry took less time to memorize 
than each individual nonsense syllable. 


from the simple sequence 3-2-1-4-3-2. Now all meaningful material is to 
Some extent redundant; therefore the additional time per comparable item 


should be less for meaningful material than for some nonredundant ma- 
terial like random lists of nonsense syllables. 


Amount to Be Learned and Number of Repetitions 


Nonsense syllables. If the exposure time of each item in a verbal learn- 
ing experiment is carefully controlled, the number of repetitions would be 
exactly proportional to the time. In Lyon’s experiments, however, he did 
not control exposure time, so that the correlation between time of learning 
and number of items to be learned is not perfect. Indeed, when we look at 
the results of Lyon’s observations on the amount to be learned and number 
of repetitions shown in Figure 45 we see that number of repetitions and 
time per syllable behave very differently. The curve in Figure 45 shows the 
number of repetitions, or trials, per syllable necessary for learning from 
the same study as the distributed practice example in Figure 44. The num- 
ber of repetitions required per syllable increases rapidly at first, but then; 


2 The immediate memory span is the number of items that can be immediately re 
called after only one trial. 
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rather Surprisingly, it reaches a maximum and begins to decline. The result 
1s that Lyon required fewer repetitions per syllable to learn a list of 200 
TOnsense syllables than he required to learn a list of 16 nonsense syllables 
(a little multiplication will show, of course, that the total number of repeti- 
tions was greater in the case of the 200 syllable list). These results are not 
Peculiar to Lyon as a learner; in his monograph he summarizes results of 
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L Figure 45. Number of repetitions per syllable required to learn lists of nonsense syllables of vary- 


ng lengths bythe. once per day method. This curve is based upon the same study illustrated in 


Figu, 
"e 44. (Dota from Lyon, 1917.) 


PXperiments by Ebbinghaus and Meumann which show essentially the same 


‘1g, though not in so dramatic a form as Figure 45. 
a hus it Seems probable that the number of repetitions does not increase 
With the number of syllables beyond some relatively small number, and 
that it may actually decrease, especially if the exposure time is uncontrolled 
AS it Usually was in these early experiments. What if the exposure period is 
Carefuly controlled? Unfortunately, we do not know what happens, since 
yee are no data over a range of material comparable to that investigated 
Y Lyon in Which exposure time per item is controlled. These experiments 
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are difficult and tedious, and no modern investigator has tackled them on 
the scale that Lyon did. It is quite possible that repetitions per syllable 
would not decrease if this were done, but a comparison of Figures 44 and 
45 suggests that it is unlikely that the repetitions per syllable would actually 
increase much with the number of syllables. The net result of these studies 
is to make it probable that the increase in amount of work (time or trials) 
to criterion is not disproportionate to an increase in amount of material, 
except possibly in the transition from the immediate memory span to 
longer lists. 

Meaningful material. Cofer (1941) has studied the relationship be- 
tween amount of material and number of trials to learn with meaningful 
verbal material. As in the case of Lyon’s experiments Cofer did not control 
the exposure time; consequently we should expect his results to be much 
like Lyon’s. And in fact they are, when the differences produced by Cofer’s 
more unpracticed subjects, etc., are taken into account. There is an abso- 
lute increase in the number of trials required to reach criterion as more 
material is added, but the number of trials per unit word actually decreases 
with longer lists. This incidentally is true whether one considers number of 
trials per unit word, per unit sentence, or per unit significant idea, and it 
is true whether instructions emphasize verbatim (rote) learning or logical 
(ideational) learning. Of course, the absolute number of trials per word is 
much smaller than in the case of Lyon’s experiments on nonsense syllables, 
even though Cofer’s subjects were relatively unpracticed. This is, of courses 
because of the higher redundancy in meaningful material. 

The results of the available experimental evidence clearly suggest that 
amount of material does not necessarily disproportionately increase difli- 
culty of learning as has commonly been supposed both by learning theorists 
and those only casually interested in learning. This conclusion, however, is 
limited by the word “necessarily” because many of the important experi 
mental parameters have not been thoroughly explored. It is possible, for 
example, that material with high similarity between items would result not 
only in an over-all greater number of repetitions per syllable, but repetitions 
may be disproportionately greater for longer lists composed of such maz 
terial. This whole question of the relationship between trials to criterion 
and amount of material is ripe for exploration, particularly since there isa 
very real possibility that the results of such exploration may lend themselves 
readily to interpretation in terms of measures of information. 


CHAPTER 9 


TRANSFER OF TRAINING 


hole of the psychology of learning 


There is no more important topic in the w 
nows that transfer of training 


than transfer of training. Nearly everyone k : 
is basic to educational theory. Practically all educational and training pro- 


rams are built upon the fundamental premise that human beings have the. 
ability to transfer what they have learned from one situation to another. 
This assumption is evident in things as different as the curriculum of a col- 
lege of classical tradition and an adolescent who is persuaded by his mother 
to attend dancing school in the fond hope that training in the graces of ball- 
Toom dancing will permit him to navigate the length of the house without 


danger to less sturdy furniture. : f 

The problem of transfer of training arose in the LL of educational 
theory, but it has always been fundamental to psychological theory as well. 
Indeed, the basic psychological problems in the transfer of training pervade 
the Whole of the psychology of human learning, and in one sense the major- 
tly of the topics this book examines are problems in transfer. Thus, the 
Stimulus generalization of a conditioned respors? the chaining of responses 
'n maze learning, and the retention of verbal learning all involve special 
Cases of transfer of training. _ k 

In this chapter we shall examine some major techniques for the study of 
transfer of training as a problem in itself, some conditions that lead to posi- 
live transfer between tasks, and som 


0 
f the study of transfer. 


e theoretical principles that arise put 


THE MEASUREMENT OF TRANSFER 


ning or performance on one task has 
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in the study of transfer of training is to compare the performance on a par- 
ticular task of two groups of subjects, one of which has practiced on another 
task. There are a number of ways in which such an experiment can be per- 
formed and the comparison evaluated. These will be discussed in the follow- 
ing section. 


Experimental Designs in the Study of Transfer 


The proactive design. The proactive design is the simplest arrangement 
in the study of transfer. Basically, there are two groups of subjects, one of 
which practices a preliminary task and both of which then are tested on a 
subsequent task. Thus the basic conditions can be summarized as follows: 
Experimental group 

Learns task A 
Control group 
RES syns aes EEE EIEIO G EEE HAUSA EEE Test on task B 

If task A has had an effect upon task B, these two groups should perform 
differently on the test of task B. If the experimental group performs signifi- 
cantly better than the control group on task B, we can say that proactive 
facilitation, or more generally positive transfer, has taken place. If, on the 
other hand, the experimental group performs significantly worse on task B, 
we can say that proactive inhibition, or negative transfer, has taken place. 
If there is no difference between the two groups on task B, of course, nO 
transfer has taken place. 

It takes at least two groups of subjects to perform a transfer experiment 
of this type. For many purposes, however, we may wish to increase the 
number of comparisons. We may have, for example, several variations in 
task A. In such a case we should want to compare each of the variations 
in task A with a control or rest condition. Furthermore, we may want to 
compare the amount of transfer between task A and B with the effect of 
direct preliminary practice on task B. Thus a more complicated proactive 
experimental design might be as follows: 

GLHOD EL LEMS EHEEB i i saa NaI LEER EIEE EE AEES Test on task B 
GIOUD 2: LEASE AY wussstnassnssetansnaseawans Test on task B 
GLUON SE LEASE Ay sean tvaas ESE Resa Test on task B 
SHON CE RESIS ice wile Sas FUE EEFLILE wii Test on task B 

It is surprising to find out that there are circumstances when transfer 
from one task to another is greater than the advantages of direct practice 
on the task to be tested. We shall examine some of the circumstances sur- 
rounding this unusual case later on. 


PET CT TPE EOE EEE UNIO: Test on task B 
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One further variation in the proactive design provides for a retention test 
for the second task. In the simplest example of this design, one group learns 
task A, then both practice on task B, and finally both are tested for reten- 
tion of task B. The design is as follows: 

Experimental group 
Learns task A..... 


Control group 
RESIS iets ema Learns task B..... Tested for retention on task B 


This particular design is not so important to the theory of transfer of 
training itself. It is, however, fundamental in the application of the theory of 
transfer to an understanding of the forgetting process, and in the next 
Chapter we shall have ample opportunity to see examples of the use of this 


design. 
The retroactive design. The other commonly employed experimental 


Plan for the study of transfer is known as the retroaction experiment. In 
this design, an experimental group practices first on task A, then on task B, 
and finally on task A again. The control group practices only on task A 
for a time comparable to the experimental group. Thus the experimental 


Learns task B..... Tested for retention on task B 


design is as follows: 
Experimental group 
Leams task A... wees 


Control group Test on task A 
LEAS THSEA. > 2 ists Rit ex 24.75 #4 sett as 


The critical comparison is on the final test of task A. For readily apparent 
reasons, this design is sometimes called the “fore-and-after” method. It has 


the advantage that the initial practice on task A can serve to equate the two 
d thus reduce extraneous variability 


Learns task B...s oes es Test on task A 


Sroups of subjects in performance anc ; vari 
On the final test. Furthermore the design enables one to examine directly 


the influence of learning task B upon the retention of task A, and this is 
5 
Sometimes very important. ্ 
The control group in proaction and retroaction experiments. In the 
ideal experiment the control subjects would be held in absolute inactivity, 
SO that the resulting differences between the experimental and control con- 


ditions could be entirely attributed to the task assigned the experimental 
Sroup. In practice, of course, it is impossible to keep human or animal sub- 
ctive. This seldom is a serious matter with animal sub- 
e we have good grounds for supposing that they are 
ks they are to learn when they 


it is unlikely that animals will 


lects completely ina 
Jects, however, sinc 
Mot engaging in activity relevant to the tas 
are away from the testing situation. Thus, 
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rehearse task A in a retroaction design. It is possible, however, that human 
subjects will rehearse or otherwise engage in activity appropriate to the 
tasks they are required to learn in the experiment. Consequently, in most 
transfer experiments with human subjects, the control group is not kept 
completely inactive. Rather it is usually given some task to perform in the 
“rest” condition that keeps it occupied; this task is one that we know from 
previous work will have little or no effect upon the task in which we are 
testing for transfer. This is particularly important in studies of retention, for 
which rehearsal outside of the experimental learning situation would have 
serious consequences for the outcome of the experiment. 


The Evaluation of Transfer ‘ 


It is obvious that one evaluates the amount of transfer by comparing the 
performance of a control and experimental group of subjects. There are 
many different ways in which this comparison can be made, however, and 
sometimes they lead to quite different conclusions. Therefore the question 
of the evaluation of measures of transfer deserves some attention. 

The ideal transfer measure would be one that could be expressed as a 
percentage in such a way that 0 per cent means no transfer and 100 per cent 
means perfect transfer, or the same performance level that would have been 
achieved if an equivalent amount of practice had been devoted to the task 
on which testing occurs. Gagne, Foster, and Crowley (1948) have given a 
formula which meets these requirements. It is 


(Score, transfer group) — (score, control group) 

(Total possible score) — (score, control group) 2300 

This formula is particularly useful in the proaction design in which there is 
no fore measure on the task receiving the transfer to take into account. 
By this formula, if the score of the transfer group is the same as the con- 
trol group (which had not received practice on another task) there is zero 
transfer. If, however, the transfer group achieves the maximum possible 
score and if the control group does not (as would certainly be the case), 
transfer is 100 per cent. 

This formula cannot be applied in every situation, however. In many ex- 
amples of learning we do not really know what the total possible score is, or 
it is so unrealistic that we may end up with a distorted picture of the 
amount of transfer obtained. Furthermore, in the retroaction design we 
may want to take account of the effects of preliminary practice on the task 
receiving transfer, and this formula ignores such effects. 

Expressing amount of transfer in such a formula is convenient, and it is 
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frequently done. It may however obscure the way in which transter from 
One task to another occurs. It is quite possible, for example, that transfer 
may affect initial level of performance on a task but not rate of learning on 
that task (see Woodworth and Schlosberg, 1954). It is even possible that 
transfer could affect the upper limit of performance without affecting rate 
of learning, though this is unlikely. The point is, however, that expressing 
transfer in terms of some simple formula like the one above would obscure 
Such differential effects of transfer. The particular parameter of the learning 
function that transfer affects may be very important in the practical evalua- 
tion of transfer in a training program (Lawrence, 1954), and we shall have 


More to say about this matter later. 
THE THEORY OF TRANSFER 


The Educational Backgrounds of the Problem of Transfer 


Educational theories which concern transfer of training are older than 


the experimental study of the learning process. One does not need to look 


far to see why transfer of training is something that must be considered in 
any theory of education. In the first place, as we have already mentioned, 
education assumes transfer, for aside from its intrinsic value, there is no 
Point to education apart from transfer. Furthermore,Wwhen the empirical 
Study of the learning of children began, it was apparent that there were 
ut the transfer of training that demanded 
Solution. Consequently, early educational psychologists were much inter- 
ested in transfer; indeed, one could say that transfer of training was the 
Central problem of early educational psychology. Since many of the theories 
and controversies that arose about transfer of training among the early 
educational psychologists are still with us (though usually in more artful 
and sophisticated guises), an examination of these will set the stage for the 


Analytic study of transfer. 
The notion of formal discipline. 


Many practical questions abo 


The idea of formal discipline, for a 


long time, was the béte noire of educational theorists. This was the idea, 
Informally held by many people, that mental exercise is good for the mind. 
he classical preparatory curriculum in education embodies this notion. 
In former days the hapless student studied Greek, rhetoric, geometry, and 
atin, not so much for their intrinsic value but because they exercised and 


Sharpened the “mind.” Learning to conjugate in Latin would make a keen 
Wit as a lawyer, or training in logic would enable the physician to exercise 


the healing arts more effectively. 
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Despite the apparent embodiment of the notion of formal discipline in 
older educational programs, it never really had the support of thoughtful 
men (see Stroud, 1940). The evils of formal discipline, however, became a 
sort of straw man in educational theory, simply because 40 or 50 years ago 
the educators needed a good lever to pry the traditional curriculum loose 
from the schools. 

Early studies of transfer. Early experimental studies of transfer of 
training were in large part directed towards demonstrating the inadequacy 
of the formal discipline theory of transfer—the notion that there would be 
transfer between two tasks to the extent that the first provided good hard 
mental exercise. Consequently there was much early emphasis upon the 
small amounts of transfer that actually took place between the formal aca- 
demic disciplines. For example, Thorndike (1923) showed that there was 
only a slight advantage in the reading of English for students who had pre- 
viously studied Latin. Similar studies (Thorndike and Ruger, 1923; Wilcox, 
1917; Cole, 1924) showed the limited transfer of Latin to such varied as- 
pects of English as reading, grammar, and spelling, as well as to the acquisi- 
tion of modern foreign languages. 

Relatively early, Thorndike and Woodworth (1901) performed a series 
of experiments designed to test theoretically some of the implications of the 
notion of formal discipline. In addition to experiments on classroom leain- 
ing, Thorndike and Woodworth performed studies of transfer in laboratory 
tasks. For example, they practiced their subjects in the estimation of the 
areas of various geometrical figures such as triangles and rectangles. They 
examined the influence of transfer from learning to estimate the area of cer- 
tain specific forms to the ability to estimate the area of other forms. The 
results of this experiment led Thorndike to an “identical-elements” theory 
of transfer. This notion and some that grew up in opposition to it are impor- 
tant, for they are the direct ancestors of our current interpretations of the 
mechanisms in transfer. 

Identical elements versus general principles in transfer. The identical- 
elements theory of transfer says that the specific elements identical to two 
tasks are transferred from one task to the other. Thus, training in addition 
benefits performance in multiplication for the simple reason that many of 
the responses required in addition are the same as those required in multi- 
plication. From the notion of identical elements, then, one would expect the 
amount of transfer to be determined by the number and importance (rela- 
tive frequency of occurrence) of responses common to two tasks. 

This concept is a vast improvement over the omnipresent transfer pre” 
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dicted by such a notion as formal discipline, for the theory of identical ele- 
ments clearly predicts that transfer will differ in amount between two tasks 
according to the variations in the tasks themselves. 

Educational theorists, however, have not always liked the notion of 
identical elements because of its suggestion of an atomistic structure in 
what individuals learn. People learn, they say, not just isolated responses 
Which may work in some new situations and not in others; they learn gen- 
eral principles and rules for applying these principles in a rational way. 
Thus, to use the same example we used to illustrate identical elements, it 
is not merely that a child can apply what he has learned about sums to 
multiplication of n-digit numbers; it is that some general principles about 
the properties of decimal numbers apply to multiplication. It is as if a 
bright child were taught the basic principles of addition and subtraction, 
and from these, himself discovered multiplication and division. 

For a while educational psychologists and learning theorists acted as if 
there were two mutually exclusive notions about transfer—one right and 
One wrong. Common experience, however, suggests that transfer can occur 
by both identical elements common to two tasks and general principles that 
EOvern two or more tasks. The basic psychological problem is not to decide 
Which of these two possible modes of transfer is correct or more valuable, 
but to attempt to uncover the basic rules governing all kinds of transfer. 
This has been the goal of most learning theorists interested in transfer, and, 
in part at least, they have been successful. The most elementary step, of 
course, is a stimulus-response analysis of the transfer problem, and that is 


What we shall examine next. 


The Stimulus-Response Analysis of Transfer 

A stimulus-response analysis of transfer is important because the most 
elementary way two tasks can differ from one another is to differ in some 
simple stimulus or response component. Thus if we train an organism to 
Make some response to a particular stimulus and then fest for this response 
With a slightly different stimulus, We have an opportunity for the evaluation 
Of transfer. We can see that this is the kind of situation in which we measure 
Stimulus generalization; indeed, this generalization turns out to be a special 
Case of transfer of training, perhaps the most elementary. Let us examine 


it as a special case of transfer. 
Stimulus variation with response 

teristic of the measurement of stimulus gener 

response constant and vary the stimulus that ha: 


held constant. The essential charac- 
alization is that we hold some 
s either been conditioned to 
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the response or has been consistently paired with the reinforcement of that 
response. Remember that the major results of the study of stimulus general- 
ization showed that the more similar the test stimulus was to the original 
training stimulus, the greater the potential for eliciting the original response. 
From these studies we should guess that the amount of positive transfer 
would gradually decline as the stimulus situation was changed from what 
it had been during original training (assuming the responses required re- 
main the same). Such a relationship is illustrated in Figure 46. On one axis 
in this figure is the amount of (positive) transfer from 100 per cent to Zero 
and on the other axis is stimulus similarity. Notice that 100 per cent transfer 


100 


PER CENT POSITIVE TRANSFER 


[) 

+ DECREASING STIMULUS SIMILARITY 
(STIMULUS 

IDENTITY) 


Figure 46. Per cent transfer of training between two tasks when the responses are identical and 
the stimuli are varied from being identical to being completely dissimilar. 


is defined as that case in which the test stimulus is exactly the same as the 
one used in original training. 

We may ask, Does the kind of relationship shown in Figure 46 hold for 
more complicated examples than conditioning? That is a difficult question 
to answer, since procedures used in the study of different learned acts are 
likely to be very different. Let us, however, examine some relatively simple 
examples of verbal learning. 

In one well-known experiment (Yum, 1931) subjects learned to asso- 
ciate nonsense syllables with stimulus items. Each syllable was paired with 
a different item, and after the subjects had learned all the associations, they 
could recall the appropriate syllable when they were given the proper stim- 
ulus item. In some conditions the stimuli were other nonsense syllables and 
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mn others they were abstract geometric forms. Let us look at one condition 
in which the stimuli were the latter. The subjects first learned to pair each 
form with a different nonsense syllable (there were 12 in all). Then, 24 
hours later they were presented with the stimuli once more and asked to 
recall the right syllables. In some conditions the same stimuli as those used 
in original training were presented. In other conditions these stimuli were 
altered so that they varied all the way from resembling the original stimuli 
Very closely to resembling them only vaguely. The results of the recall test 


Were as follows: 
Per cent Per cent of recall to 


Stimuli recalled identical stimuli 
Identical .....cceeeeee 84.6 100 
Highly similar ......- 64.5 76 
Similar ...... + 49.2 58 
Less similar ....ceeeee 45.3 53 

36.3 43 


Least similar .....e 
arity of the stimulus items to those used in 
ercent of items recalled decreased. Thus, 
h dissimilar stimulus items than with sim- 


We can see that as the simil 
Original training decreased, the P 
there was less positive transfer wit 


ilar ones. 
Results much like these were also obtained by Gibson (1941), but her 


experiments went one step further. This is important because one could 
argue that the subjects in the experiment just described did not respond to 
the nonidentical forms as frequently as they did to the identical ones simply 


because they did not know that they were supposed to. Gibson, however, 


Studied relearning of the same responses when the stimuli were identical or 
Somewhat varied. The results were much the same; there were greater sav- 


ings in relearning when stimuli were i 
they were only slightly similar or dissimilar. 
pmuli constant. What happens when 


Changing responses and holding sti) 
the same but when the responses are 


the stimuli for two different tasks are when th 
different? There is an old rule of thumb that says this situation always pro- 


duces negative transfer. This implies that the second task is harder to learn 


than it would have been if there had been no first task. The further impli- 
n task one interfere with the acquisition 


Cation is that the responses learned i CHINN ACY 
Of new responses to the same stimuli. We shall deal with this implication at 
l টী 
ngth in the next chapter. 

For now, we should note that t 
of what really happens in practice. 


his rule is a considerable simplification 
Actually, it sometimes happens that 


ত 
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there is positive transfer when responses are changed and the stimuli are 
the same. Let us look at some examples. 

In one experiment (Kellogg and Walker, 1938) dogs were conditioned to 
flex their right rear legs to a buzzer by pairing the buzzer with a shock to that 
leg. The shock was then transferred to the left rear leg. The animals learned 
this response much faster than they did the original right-leg response. 
Further exploration showed that it was easier to recondition the right leg 
than to condition the left leg after original conditioning. Furthermore, it 
turns out that there was a rough gradient among the four feet, so that after 
the conditioning of the right rear leg, the left rear leg was easiest to con- 
dition, then the right front leg, and finally the left front leg (Kellogg and 
Wolf, 1940). 

In this situation the transfer is evidently a matter of components of the 
responses in original training being common to the transfer situation. In the 
initial conditioning of the right rear leg the animals sometimes respond with 
the other feet as well. The degree to which the conditioning of the right rear 
leg transfers to the other feet depends upon the probability that the animal 
will respond during original conditioning with one or more of the other feet 
(it should be mentioned that in these experiments the dogs were supported 
in such a way that they did not need to use their feet for support). 

What about the case in which changing the response leads to negative 
transfer? This would happen if a subject learns the association SiR: and 
then finds it more difficult to learn the association S;—Rs» than if he had 
not made the original association. In one of the best-known experiments on 
this problem (Bruce, 1933) the associations consisted of pairs of nonsense 
syllables. Thus the subjects would be required to give a response like KIV 
when given the stimulus REQ. For the second list the subject might be re- 
quired to give a response like ZAM to the same stimulus REQ. Under these 
conditions negative transfer seems to occur. Learning a new response to 
an old stimulus produces interference. 

Not all experiments, however, have found this to be so. For example, 2 
more recent experiment (Porter and Duncan, 1953) which used the same 
design, except that two-syllable adjectives were used instead of nonsense 
syllables, showed no negative transfer when a new list of response items 
was learned to old stimulus items. The lack of negative transfer was partly 
explained by the fact that since the subjects in this experiment were not 
highly practiced at rote verbal learning, simply learning some of the tech- 
niques of this kind of memorization obscured some of the negative trans” 
fer. There is, however, a more fundamental reason why negative transfer 
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did not occur, for in this experiment negative transfer was found with 
slightly different conditions. If the second task consists not of an entirely 
new list of response terms learned to the same stimuli but of the same re- 
sponse terms shuffled so that they are paired with different stimulus items, 
negative transfer occurs. Thus under this condition negative transfer was not 
Swamped by general-practice effect. The reason, of course, was that the 
interference was much greater because the responses in the second task 
Were already connected to other stimuli in the list making up the first task. 

Thus, in general, it appears that if all other things are equal (this usually 
means that there is no practice effect getting in the way), responses which 


are mutually exclusive will interfere with one another when we try to at- 


tach them to the same stimuli. In the experiment we mentioned a few para- 


graphs ago on the conditioning of different legs in dogs, the responses were 


not mutually exclusive or antagonistic, and they did not interfere with one 


another; indeed, since they were all part of common components of reaction 
to electric shock they tended to facilitate one another. 
This difference of positive transfer between responses learned to the 
Same stimuli when they are components of the same act and negative trans- 
fer when the responses are mutually exclusive is sometimes expressed by 
Saying that the kind and amount of transfer depends upon the amount of 
tesponse similarity. Response similarity means that When a response is asso- 
ciated to a given stimulus there are other responses which will tend to be 
elicited by that stimulus. It is assumed that there exists some gradient of 
Tesponse similarity such that, to a given stimulus, some responses have a 
Breater probability of being elicited than others. We can see that the notion 
Of response similarity is roughly analogous to stimulus similarity. | 
The transfer surface. We can put all of the stimulus-response relation- 
Ships in transfer together in one diagram, but the diagram will require 
three dimensions, one for stimulus similarity, one for response similarity, 
and one for amount and direction of transfer. Such a diagram, derived from 
One originally drawn by Osgood (1949), is Shown in Figure 47. The part 
Of the figure in the lower right-hand corner is Where maximum negative 
transfer is found. This is the case in which the stimuli between two tasks 


are the same but the responses required are different. On the other key 
Maximum positive transfer is found in the upper ERE 
Stimuli and responses are identical—in other words when the secon 
1s merely further practice on the first. From this diagram we tha 
further practice on the same task is onlya special and limiting case of positive 


transfer, and perhaps logically it should be. On the opposite side of the dia- 
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gram, which is the side on which stimuli are unrelated, transfer is very 
weak. The limiting case here is the extreme right end of the diagram. Here 
the stimuli between the two tasks are unrelated and there is no similarity 
of responses. Therefore, no transfer, positive or negative, will occur between 
the two tasks. We should not expect learning how to swim to have much 
transfer to performance at the typewriter. 

This diagram summarizes all of the possible stimulus-response relation- 
ships between two tasks. It ought not, however, be taken too literally. For 


RESPONSE SIMILARITY-DESCENDING SCALE 


Figure 47. The hypothetical transfer surface. The amount of transfer (positive and negative) as d 
joint function of stimulus similarity and response similarity. (After Osgood, 1949.) 


one thing—as we shall shortly see—there are apparent exceptions to the 
rules implied by the diagram. For another thing we do not have general 
quantified scales for stimulus and response similarity. Therefore the exact 
shape of the diagram has no meaning; only the general relationships are 
meaningful. If any of the dimensions were expanded or contracted at the 
expense of any others, it would not do violence to the basic notion presented 
in the diagram. In short, it is a qualitative picture of the stimulus-response 
relationships in transfer, and with the exceptions noted below, it works 
fairly well in summarizing data on transfer. 

Transfer situations not summarized by the transfer surface. There are 
many kinds of transfer situations that are not easily summarized by the 
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transfer surface. Among these is learning how to learn, the general improve- 
ment that comes in learning many different tasks all of which demand a 
common technique of learning. In theory, it can be argued, this kind of 
transfer can be considered as a special case of the transfer surface, by 
thinking of the technique of learning (as it occurs, say, in rote verbal mem- 
Orization) as consisting of a series of highly similar responses involving 
mode of attack, etc., which run through all the tasks to be learned. We are 
not always sure what such responses would be, however, and considering 
the general problem of learning how to learn as a special case of response 
Similarity leads more to the aesthetic satisfaction of having one simple prin- 
ciple of transfer than to a real understanding of the mechanisms in learning 
how to learn. 

More seriously, there are real exceptions to the rules implied in the 
transfer surface. For example, Young and Underwood (1954) note that 
Positive transfer does not always occur when stimuli are different and re- 
Sponses are similar to one another. They showed that there will be little 


Telationship between response similarity and amount of positive transfer if 
the responses within the two tasks are not difficult to differentiate. In other 
Words, similarity of response between two tasks will not be an important 


Variable in transfer unless the responses within the tasks are difficult to 
differentiate one from another. Thus the extent to which the transfer sur- 
face applies depends upon the stimulus-response relationships within tasks. 

An even more serious difficulty with an uncritical use of the transfer 
Surface is indicated in an experiment by Deese and Hardman (1954). In 
this experiment subjects learned to complete sentences by the method of 
Paired associates. Thus a statement like “the path is” would be given as a 
Stimulus item, and the subjects would learn to complete the statemegf by 
Living the correct word “rocky.” A series of such statements were Biven as 
the original task. In the second task, the same stimulus items were given 
but the subjects had to learn new response ern one ণ্ড the new re- 
Sponse terms were synonyms (thus in the case of “the path is,” the new re- 
Sponse would be “stony”); in the other case the new response terms were 


Unrelated to the old ones (thus to the example Elven, the new response 
Would be “short”). Looking at the transfer surface in Figure 47 we should 
Probably expect the synonyms to yield positive transfer and the unrelated 
Words to yield negative transfer. And that is exactly What happened in the 

od. This experiment went one step 


earnin k. So far SO £0 Eo & 
8 of the second tas sked to relearn the original list. In this 
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negative transfer was greater after learning the synonyms than after learning 
the unrelated words. Thus, in this experiment, positive transfer was found 
in proaction with synonyms and negative transfer in proaction with un- 
related words. In retroaction, however, both cases produced negative trans- 
fer, greater in the case of the synonyms, in apparent contradiction to the 
transfer surface. 

The explanation for this finding is that the synonyms in the second task 
were easier to learn and hence better learned in a fixed number of trials 
than the unrelated words. Thus they competed more with the retention of 
the original items than did the relatively weaker unrelated words. This is a 
reasonable explanation that does not violate the spirit of the transfer Ssur- 
face, but it should make us cautious in making a blanket application of the 
relations in the transfer surface. In general it is less easy to go astray in 
applying the transfer surface to proaction than to retroaction experi- 
ments. 

Identical elements and common principles in the transfer surface. The 
transfer surface is a lineal descendant of the theory of identical elements in 
transfer. This is because most of the experimental studies of transfer have 
emphasized the analytic aspect of the problem, and the relationships be- 
tween tasks have been dissected into their stimulus and response compo- 
nents. The similarity relationships that have been studied can easily be con- 
sidered in terms of common elements. Therefore the transfer surface can be 
taken as a kind of detailed account of the transfer one would expect with 
identical elements in two tasks. 

What about transfer by general principles? It is not difficult to stretch 
the same stimulus and response analysis to this case. One well-known study 
of transfer in a schoolroom task illustrates this. Knight (1924) had two 
groups of children practice the addition of fractions. One group practiced 
on fractions with denominators ranging from 2 through 30. Another group 
practiced with a limited nuniber of denominators, 2, 4, 6, 8, 12, and 24. 
A test was given on those denominators practiced by the first group but not 
by the second. Both groups did just about equally well on the test. In other 
words transfer was perfect. It is easy to see that this example, which fits 
the description of transfer by general principle, also is in accord with the 
transfer surface. In this case some component, a general rule, was common 
to the two tasks. 

Wherever possible we shall try to refer examples of transfer of trainings 
both in this chapter and succeeding chapters, to the transfer surface. While 
some experimental situations do not lend themselves easily to a simple 
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stimulus-response analysis, the general usefulness of the transfer surface 


is considerable. 
SPECIAL PROBLEMS IN TRANSFER 


Learning How to Learn 
t general and important kinds of positive transfer is 
learning how to learn. Every investigator of verbal learning knows that sub- 
jects who have never learned by rote association do relatively poorly in 
laboratory studies of paired-associate or serial-anticipation learning. Such 
Subjects will take a large number of trials to learn their first list of, say, 
nonsense syllables. They generally show great improvement on the next 
list, however, and on each succeeding one they show additional improve- 
ment. After 15 or 20 lists, however, the amount of improvement from list 
to list is relatively small; the subjects have learned just about all they can 
about the techniques of learning nonsense syllables by rote. 
B Learning how to learn is a very general phenomenon. So far as we know, 
it occurs in some degree for all techniques of learning and for all kinds of 
tasks. One of the central problems in the transfer of training is to find out 
how and why such improvement occurs, 28 well as to see what the implica- 
tions are for more general problems in the psychology of learning. 
Learning sets. In the chapter on discrimination learning we examined 
One such attempt to understand the mechanisms in learning how to learn. 
This particular problem is the formation of learning sets and is specifically 
Concerned with the extent to which improvement occurs in successive dis- 


Simin: 
Timination problems. 

ৰ We have seen in our discussion of 

i টী 

Sms are given successive problems, t 


One of the mos 


discrimination learning that as organ- 
heir ability to solve each problem im- 


Proves. Thus it may take a monkey many trials to learn that a raisin is to 
be found under a red box instead of a blue cylinder if this is the first dis- 


crimination test it has ever worked at. After a number o£ such tests, how- 
ever, the monkey needs only one trial to show perfect discrimination; 

An analysis of the learning-set problem showed that it was the result of 
reduction of stimulus generalization (and hence negative transfer, since 
each problem requires a different stimulus-response connection) and the 
Sradual emergence of a new habit, a habit of testing stimulus objects irre- 
Spective of every dimension except position (left or right) for their corielaz 
tion with reinforcement. The implication is that through the reduction of 

w habit is allowed to 


Stimulus generalization and negative transfer a ne 
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emerge which transcends the particular problem the animal is working on. 
The animal learns to look for the significant variation in stimuli, and his 
solution is likely to appear to be insightful. 

The development of a learning set can be characterized as the emergence 
of a higher-order habit composed of the single common component rein- 
forced in a number of lower-order habits. Each discrimination problem can 
be looked upon as a lower-order habit which must be mastered. The only 
behavioral feature that is consistently reinforced from problem to problem 
is the habit of making a choice and changing the choice when the initial 
one is incorrect. The habit of choosing blue stimuli because of correlation 
with reward in one problem or square stimuli for another problem will 
only be reinforced part of the time. The only aspect of the set of discrimina- 
tions consistently reinforced is that of reversing the choice of stimuli when 
the initial one is incorrect and keeping with the initial choice when it is 
correct. 

Do learning sets occur outside discrimination-learning problems? As we 
shall see later, there is an analogy between learning abstract concepts and 
learning sets. Furthermore, probably many examples of learning how to 
learn are the result of emergence of higher-order habits. Moreover, even 
the relatively simple examples of learning how to learn illustrated by dis- 
crimination learning in monkeys involve such diverse factors as learning 
not to perseverate to the same stimulus from trial to trial and learning to 
respond to differential aspects of the cues (Harlow, 1950b). 

Teaching how to learn. We have seen that there is a general improve- 
ment in ability to learn that comes with practice at a particular kind of 
learning. There is more to it than this, however. Many kinds of remedial 
educational programs are built on this assumption, plus the supposition that 
special methods of teaching how to learn will produce even greater im- 
provement. 

In a classical experimental study Woodrow (1927) showed that specific 
instruction in how to memorize poetry, prose, and factual material resulted 
in greater improvement in ability to memorize than undirected practice. 
Woodrow stressed many of the techniques that are now standard in teach- 
ing students how to learn. He used active recitation, attention to meaning 
techniques for maintaining continuous alertness, etc. Woodrow’s study was 
not analytical, and we cannot tell which of the factors was most important 
in improving memory. The important point, however, is that training for 
memory does have beneficial effects over and above those that come from 
practice alone. 


— 
সক 
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Differentiation and T' ransfer 

In the section devoted to stimulus-response analysis and theory of trans- 
fer we pointed out that one important interpretation of stimulus and re- 
Sponse relationships was that transfer was to a considerable extent mediated 
by differentiation. This was the view of Gibson (1941), and it has received 
Considerable experimental attention (Young and Underwood, 1954; Under- 
Wood, 1954a). One of the implications of this notion is that if we could 
induce differentiation among either the stimuli in a task or the responses 
Without specific practice on the task or on the responses in il, we could 
Produce positive transfer. 

A simple example of stimulus predifferentiation is illustrated in an ex- 
Periment by Gagne and Baker (1950). These investigators gave subjects 
the task of moving a switch when a particular light came on. There were 
2 number of lights located in different positions, and each light signaled a 


separate switch. Some of the subjects were given preliminary practice, not 
in merely assigning letters of the alphabet to 


nN moving the switches, but : f 
associating the lights with names had an 


each of the lights. This practice in i { h 
enormous influence on learning to associate the lights with the switches, 


even though the switches had nothing to do with the letter names. Those 
Subjects that had no stimulus-predifferentiation training made about 7 
Mistakes in their first 20 trials on the switch task. Those who had 32 stim- 
s made only about 3 errors. f ‘N 
interpreted to mean that stimulus Predificrentiation 
teduces the amount of stimulus generalization between the Various stimuli, 
$0 that in the final task, when these stimuli are associated with motor re- 
Sponses, they are more distinctively different for the subjects. J 

There have been many similar experiments demonstrating essentially the 
Same thing. Goss (1953), however, has pointed out that it is not entirely 
Certain that the obtained transfer in these studies has been due to stimulus 


Predifferentiati has further analyzed this problem and shown 
FO NOnACT DT deed the main factor in the trans- 


that whi st i ation is in 

ile stimulus predifferentiation : y 
CT effect, other a such as Warm-Up, also contributed. In general, any 
king of labeling or differentiation of stimuli will help Hn ৰ subsequent task 
that demands different responses be attached to these stimull. 


isiti f a motor skill is common, 
Tan: ins to the acquisition 0 : EI k 
OLE because of the differentiation which 


and it very li incipally 

y likely occurs princip : : 
Preliminary bal training allows. Battig (1956) reminds us that the 
Carning of motor skills involves both discovery and performance. Discovery 


WusJetter naming trial 
This experiment was 
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means learning to differentiate between stimuli, to pick out their critical 
aspects, and to learn which ones are to be reacted to in a different manner. 
Performance means the actual motor responses which must be made. In 
many motor skills preliminary verbal training will suffice for learning the 
stimulus relationships. Battig showed that. the amount of transfer from 
verbal pretraining to a motor skill varied inversely with the complexity of 
the motor activity required. When the motor, or response, side of the skill 
is relatively unimportant, verbal pretraining is of great value. If the motor 
aspect is too complicated, there is little or no transfer to the motor skill. It 
these results are generally true for many different tasks, we may have an 
important principle for the application of verbal pretraining to learning 
skilled performance; the value of such training decreases as the motor com- 
plexity of the task increases. 


Transfer and Difficulty 


There are some rather surprising and complicated relationships between 
transfer and the relative difficulty of the tasks involved. In some cases trans- 
fer from an easy task to a hard one is greater, and sometimes the converse 
is true. Let us examine the latter case first. 

An experiment by Jones and Bilodeau (1952) illustrates the situation 
in which transfer is greater from a hard task to an easy one. Both tasks 
these investigators used involved coordinating the two hands to keep an 
indicator on a moving target. The difference between the two tasks was in 
the complexity of the movement of the target. In the hard task, the target 
moved in a very complicated path; in the easy one it moved in a relatively 
simple path. With the use of several groups of subjects, the investigators 
studied the transfer from the hard path to the easy one and vice versd- 
There was very great transfer from the complicated path to the easy one; 
as a matter of fact the transfer in this case was equivalent to direct prac" 
tice on the easy task. There was also positive transfer from the easy task 
to the complicated one, but not nearly as much. 

In the particular situation studied by these investigators, the harder task 
involved all of the components of the easy task and permitted greater 
practice. The easy task had only some of the same response components as 
the difficult task. Therefore the transfer, while positive, was somewhat Jess: 
In general, this will be true; in tasks in which more difficult versions con 
tain all of the stimulus-response elements in simpler versions plus additiona 
ones, positive transfer may be large, indeed, as large as the effects of direct 
practice in instances where no negative transfer from particular stimulus” 
response elements occurs. 
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In other situations, however, it seems that preliminary practice on easy 
tasks produces greater transfer than preliminary practice on difficult ma- 
terial. This was the case in a study of discrimination learning in rats (Law- 
rence, 1952). The rats were trained to discriminate between the brightness 
levels of two goal boxes. In one condition the animals received all of their 
training on a difficult problem, that is to say, a problem in which the bright- 
ness levels were nearly the same. In other conditions the animals received 


preliminary training on problems in which the brightness levels of the two 


boxes were quite different, so that discrimination was easy. All animals 


Were eventually tested on the difficult discrimination. 

The results, rather surprisingly, were that all the animals trained on 
easier problems did better than the animals trained on the difficult problem. 
This result is probably the result of factors much like those responsible for 
the stimulus-predifferentiation effect. Training on the easier discrimination 
allowed the animals to discover more readily the critical difference be- 
tween the two boxes and pay attention to it and to no other stimulus con- 
ditions. Thus, when these animals were transferred to the more difficult 
discrimination they were alerted to brightness differences and found it easy 
to solve the problem. Animals trained all along on the difficult discrimina- 
tion, however, found it difficult to solve the problem because the small 
brightness difference between the two boxes was never brought directly to 
their attention. 

Thus in cases in which difficult problems involve stimulus-response com- 
Ponents identical to those in easier tasks (but also additional stimulus- 
response components) there will be more transfer from the difficult prob- 
lem than from the easier one. In those cases in which the problems involve 
the isolation of relevant stimulus dimensions and the association of certain 
Stimuli with reward or solution of a problem, training on easy problems will 
transfer more than training on difficult problems 

These two different cases have not been completely explored, but it is 
likely that the principles in them can be generalized to many different tasks. 

f so, we have some important rules about the application of transfer of 


training to problems of varying difficulty. 


POSITIVE TRANSFER IN SCHOOL LEARNING 


of training has been one of the prob- 
1 psychologists. We shall not look at 
cations of transfer principles to 
ome of the more general ques- 
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tions that have come from the study of transfer in school. One of the most 
important of these is training for transfer. 


Training for Transfer 


One of the basic problems in teaching is to maximize the amount of 
transfer. We have seen that transfer from one task to another may not be 
very large; consequently educational psychologists have looked for ways 
to increase the amount of transfer. Mostly this has meant that psychol- 
ogists working in the school situation have tried out various methods for 
making transfer greater between academic subjects, or an academic situa- 
tion, and some practical problem. An example will show the sort of things 
educators have been concerned about in transfer. 

An example of training for transfer. It is generally accepted that a for- 
mal academic discipline like high school geometry provides some transfer 
to reflective thinking in general. In some circles this is regarded as a hold- 
over from the notion of formal discipline, but, of course, such transfer does 
occur. The postulatiorial technique demonstrated in EOE does general- 
ize to other things, and students are sometimes able to see this. The late 
Clark Hull, as a matter of fact, said in his autobiography that his interest 
in scientific theory generally came from his high school days and the study 
of geometry. 

In the main, however, transfer from a traditional geometry course to 
other kinds of situations involving reasoning is disappointingly small. Ulmer 
ld) Performed an experiment in an effort to see what could be done to 
SD tram oy transfer by designing a special course in high school 

Ulmer used three large Sroups of high school students in his study. One 
group served as a control. A second group of students was taught ge- 
ometry by the traditional methods. A third group was taught geometry 
through a program designed to arouse critical thinking both connected an 


ENCONnENET with geometry. The teachers for the students in this group 
used a special manual designed to extra 
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impressive evidence for the possibilities of training for transfer, but it does 
not tell us about the nature of training for transfer. We cannot know, from 
the available evidence, the complete answer to this question, but we do 
know enough to be able to see some of the more important factors. 

One paramount factor is the control of generalization. We saw in studies 
Of animal learning and human rote verbal learning that generalization is an 
essential element in both positive and negative transfer. We also saw in the 
experiments on the formation of learning sets that experience at multiple- 
task learning gives learners direct control over generalization. That is to 
Say, organisms learn to vary generalization voluntarily. Variation in gen- 
eralization is learned as a critical element in problem-solving behavior, and 
the sophisticated learner (be it rat or man) varies the extent to which be- 
havior is generalized in accordance with other things learned about the 
testing situation. Thus the monkeys in the learning-set experiments learned 
to test specific hypotheses about which stimuli would lead to reward, rather 
than to rely simply on the strongest generalization of the moment. 

This is extremely important, and we shall discuss it again in connection 
With problem solving. It should be clear in the present context, however, 
that broad generalization is essential for transfer. Unfortunately transfer is 
both positive and negative, depending on whether the generalized behavior 
is compatible with the requirements of a new situation or not. A higher- 
Order behavior is achieved when the organism can decide whether or not 
to Vary transfer based on its estimates of the relative probabilities and 
advantages of negative and positive transfer. * rf 

There are numerous applications of particular techniques of training for 
transfer, Many can be found in basic textbooks in educational Psychology 
(for example, see Frandsen, 1957). Such matters as unifying concepts in 
arithmetic and phonetics in spelling are all important NESTON of transfer 
In the elementary curriculum. Indeed, a good deal of the dispute over the 


Use of the so-called “phonic” method in reading is basically concerned with 
Kk R gative Or mostly positive transfer 
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Consider some of the erroneous defin 


g in 
itions given by university students i 
a vocabulary test (Johns, 1939): 


Elucidation 


To get away from (“elude”) 
Elimination (from the sound) 

Coerce 
Speak, discuss (possibly from “converse”) 
Plan of study (from “course”) 

Innate 
Without life, dead (from “inanimate”) 
Silly (“inane”) 


Obviously, some training in the fu 
vide enough control over 
tive transfer would result. 
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rules that they had learned. As we might suppose, those children higher in 
mental ability profited more from the rationalization procedure. 

The relationship between transfer and mental maturity plays a consider- 
able role in an important educational concept, the concept of readiness. 
Educational psychologists have made much of the futility and, indeed, dan- 
ger inherent in trying to teach children skills before they are ready for 
them. In general this means that one should not try to enforce practice at 
a Skill until certain basic functions have matured to a certain level. It is im- 
Possible to teach most preschool children to write, because they have not 
yet the command over the fine muscular movements that are required in 
Writing. Such children can only make rather large and clumsy movements 
with their fingers and arms. It is to be noted, however, that a considerable 
(though indeterminate) part of the maturity required for the ability to 
Write comes from previous training. Scribbling, drawing, working with the 
Paint brush, and indeed, a whole host of related activities probably all 
give the child an ability to work with the muscles of his hands and arms 


that later transfers to writing. 


Transfer and Specialized Training Programs 

The value of general education can exist quite apart from any specific 
transfer value assigned to parts of it. This is not true of most specialized 
training programs, however. If the Army or the Air Force invests time and 
money in training a man for a particular job, it wants to know whether this 
training has actually made him better for the job than he was before train- 
ing. In other words, in specialized training programs transfer is a critical 
element. ft 

In many applied problems in transfer the important problem is that of 
validity. We need to ask the question, Does training on a special device or 
by special techniques show positive transfer to the actual task? There are 
many questions subsidiary to this main question. For example, on-the-job 
training is usually very expensive. Will a preliminary training program not 
Only transfer, but will it transfer enough to the actual task so that the 
training program is, in the long run, cheaper than on-the-job training? Such 
Problems are of very real concern to the applied psychology of transfer, and 
they have received much attention (Lawrence, 1954; Tufts Institute, 1952). 
It is too early to hope that the basic experimental work on transfer will 
allow us to predict well enough to answer these questions without a direct 
test in the applied situation, but it is not too much to hope that we shall be 
able to make fruitful guesses about which applied training programs will 
profit most from a revamping in view of the basic principles of transfer. 


CHAPTER 10 


RETENTION AND FORGETTING 


ine 
yj examin 
£ at the problems of learning. We can rdinarily 
hanges as a function of Et of prac 
rovement in performance as the resu 


F tho 
an intervening time interval. This is ec, 
hapter. Both of these approaches are, 0 1 there 
he same problem. If there were no ET Ihe 
nce the gradual increase, or improveme. ractice- 
sed upon residual benefits from previous i eS 
ctors underlying retention are the El srgetting 
Conversely, that the factors underlying fo If this 
ch tend to slow down the learning process. 

Of retention? ine 
estion is that such a study allows us to NL 1 
contaminated by further practice or expos 


2 at 
n whs 
n we have determined to what extent ny mental 
Ss us ho 
Manner something is retained, we have a measure i SE exposure 
Processes reorganize material already learned withou ization we can learn 
to the material. From a study of this process ৰ HERBAL 
টু ves. 
Something about the mental Processes themse need to define the two 
Before we begin an examination of retention, oA extent to which ma- 
Words retention and forgetting. Retention refers to the 


ion 
{ ing refers to the portio! 
terial originally learned is still retained, and forgetting refer B 
lost. Therefore 


Wwe shall consider in this [ul 


could be no learning, si 
Occurs with practice is ba: 
This suggests that the fa 
underlying learning and, 
are the same as those Whi 
is so, why a Special study 

The answer to this qu 
changes in behavior un 
learning materials. Whe: 


ned — amount retained 
= Our lea: 

orgotten = am 

Amount fl iE 


Ee om Hi BEAM aan HN 
BHAA a aaa 


RETENTION AND FORGETTING 237 


Amount retained is always measured directly, and amount forgotten is 0b- 
tained by subtraction. Since amount retained is the direct measure, it is 
the measure we prefer to use. It is useful, however, to talk about forgetting 
When we are considering those conditions which tend to reduce the amount 
retained, and this chapter is therefore separated into two sections—one 
called Retention and one called Mechanisms of Forgetting. In the second 
section, emphasis will be on forgetting as an active process, and we shall 
discuss those experiments in which there is a deliberate attempt to reduce 
the amount of material people will be able to retain. In the section on 
retention we shall examine studies of the amount and the type of things 


people remember after various intervals of time. 


RETENTION 


Retention was one of the first aspects of human learning and verbal be- 
havior to be studied by laboratory methods. Ebbinghaus studied retention as 
Part of a sustained attack on the “higher mental processes.” The results of 
his studies were published in a monograph in 1885 called Uber das Ge- 
diichtnis, or Concerning Memory. This monograph is a landmark in experi- 
mental psychology, not only because it reports important findings but 
because it represents the first experimental study of the problems of human 
verbal behavior. This opened up the whole field of experimental investiga- 
tion of thought, memory, and verbal learning. 

One of the problems which Ebbinghaus studied was how much can be 
remembered after a certain time interval has elapsed since learning. It is 
rather surprising that no one had ever thought to do this before, even in a 
casual way, but the fact is that Ebbinghaus’s results represent the first really 
quantitative data on human memory. Before we look at his findings, how- 
ever, and those of his successors, it will be wise to examine some of the 


methodological problems in the study of retention. 


The Measurement of Retention 


Experimental Designs 

Experimental methods in the study of retention fall into two classes. In 
one class the same subjects are tested for retention of the same material 
several times. An experimental study of this sort yields very impure meas- 
ures of retention after the first test, since all subsequent measures are con- 
taminated by the practice the first test allows. As a practical matter, how- 
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ever, many studies of retention of school subjects have been carried out in 
this way. 

‘A more adequate approach allows only one test of retention after learn- 
ing. This can be accomplished by (1) repeated measurements on the same 
subjects, using different materials for each test, and (2) independent meas- 
urements of different subjects, using the same material. In the first method, 
the subjects may learn one set of nonsense syllables and be tested for re- 
tention after an hour, then learn another set of nonsense syllables of equiva- 
lent difficulty and be tested after two hours, etc. In the second method, the 
subjects are divided into several groups, each of which is tested after vary- 
ing periods of time. 

There are basic and profound differences between these two methods, as 
we shall see later in this chapter. Ebbinghaus and many subsequent investi- 
gators used the first method, but modern investigators prefer the second 
because it eliminates possible interaction between materials learned at 
different times. 

If either the same subjects remembering different material or different 
subjects remembering the same material are to be compared, we must have 
some means of equating learning. This is generally done by using the same 
criterion of learning in all cases. Suppose that we want to find out how many 
nonsense syllables subjects can remember after 24 hours compared with 
the number they remember after 1 hour. We have two groups of subjects, 
both of which learn 15 nonsense syllables. We adopt a fixed criterion. 
Usually this means that we stop practice for each subject after he has 
correctly anticipated all 15 nonsense syllables twice in succession. The sub- 
jects may, of course, be given different amounts of practice before a test 
for retention, but the number of items overtly learned is held constant. 
Sometimes this is undesirable, and we adopt instead some criterion of num- 
ber of trials. Thus we may give all subjects in the two groups, say, 10 trials 
of practice. In this case we hope that, on the average, the individual subjects 
in the two groups will have learned about the same number of items, though 
we actually allow the number of items overtly learned to vary freely. 

In some cases we perform experiments in which there is no clear separa" 
tion between learning and retention. Many experiments on such dynamic 
factors as motivation in learning and retention are of this sort. The simplest 
and most common example is the measurement of the “immediate memory 
span.” The immediate memory span is simply the number (and kind) of 
separate items an individual can remember after only one exposure. Thus 
if we say the following series of numbers to someone, 4-7-2-6-3-8, and ask 
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him to repeat them, we are employing the procedure used in measuring 
the immediate memory span. We shall see the importance of this technique 
from examples given later in the chapter. Its limitation is that it does not 
allow a clear separation of learning and retention. 


Measures of Retention 

The measurement of retention is not so simple as it sounds. There are a 
number of different measures, and they do not always even approximate 
one another. We are justified in calling them all measures of retention in 
that they tend to be correlated, though some are much more sensitive than 
others, and on occasion, may reveal quite different things. 

Direct measures. The simplest kind of retention measurement isa direct 
Measure of performance. In many cases this simply means giving the sub- 
Ject one more learning trial. We may, for example, train a rat to run a maze 


by giving it 1 trial a day for 10 days. We then may “test for retention” by 
later. There may be no difference in the 


in the retention test, however, the 
ime between the next to the last 


fiving one more trial a month 
method of administering the trials; 
emphasis is on the effects of the interval of t 
and the last trial. Thus we may compare retention after 30 days with re- 
tention after 1 day. 

In verbal learning we may use an exactly analogous technique. We may 
aSk subjects to learn a list of nonsense syllables by serial anticipation and 
test retention simply by presenting the list once more. In this test, the sub- 
Jects must anticipate the next item. In verbal learning, however, we may also 
Make use of a somewhat different method of directly testing retention, which 
Minimizes the support the subjects receive from the context of what they 
have learned. This is the method of recall. After we have had subjects prac- 
tice a list of nonsense syllables by serial anticipation, we may ask them 


Simply to recall as many of the items in the list as they can. This eliminates 


Much of the context provided by response-stimulus chaining in serial antici- 


Pation. The result is that such recall scores are generally lower than the 
anticipati 
Cipation or relearning scores. 


B * 
Oth recall and relearning scores m a] টী 
€ number of items correct on the previous trial, which, if the subject 


ES the ultimate criterion, will be all of the items. Thus if we ask a 
Jct to learn a list of 20 syllables, and at recall he is able to recall only 

Ve of them, his recall, or retention score, is 25 per cent. 
sine The direct methods suffer from a lack of sensitivity. It is pos- 
» for example, that a subject may not be able to recall any nonsense 


ay be expressed as a percentage of 
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syllables at a test of retention, and his retention score is zero. Does this 
mean, then, that no effect of the previous learning remains? The answer 1S 
10, since a more sensitive method of probing may reveal some residual 
effects. 

The method of savings is a much more sensitive method for quantitative 
tests of retention. It may reveal retention where none seems to exist by 
straight recall, and, indeed, it may reveal “negative” retention—it can show 
that an individual recalls less than if he had never practiced the material 
being tested. 

As in the direct method, the subject is asked to practice an additional 
number of trials on what he has already practiced. The measure, however, 


is the percentage of time or trials saved compared with the original learning. 
It is given by the formula 


Pex'centisaved = No. trials to learn — No. trials to relearn x 100 
No. trials to learn 


Thus if it took 25 trials to memorize 15 nonsense syllables and 5 trials to 


relearn them after an interval of 1 hour, there remains a savings of 
80 per cent. 


Other methods. 


There are many variations of either the direct or the 
savings method. In addition, there is another important class of measures 
Which can be summarized under the term recognition methods. These are 
very familiar, since one example is the multiple-choice test (invented by 
Ebbinghaus, by the way). This method has greater sensitivity than the direct 
measure, since it relies most fully on context. This is precisely its weakness 
for experimental work, however. It is furthermore dependent upon various 
artifacts of testing. If we simply ask someone whether or not he recognizes 
What we present to him, we run the risk of unreliable and false judgment. 
If we more strictly test an individual by asking him to pick out which of 
Several items is correct, the probability of a correct selection on the part 
of the subject is at the mercy of the particular choices he has to make. We 
can vary the difficulty of an item simply by varying the incorrect alterna- 
tives. If we do this in a controlled manner, we may have a technique that 
is instructive. For most purposes, however, the method of recognition is lim- 
ited to the applied problems of retention of school material and to the study 
of perceptual memory. 

The study of perceptual memory sometimes introduces another tech- 
nique, that of reproduction. This is a variant of the method of recall, in 
which an individual is asked to draw or otherwise reproduce something he 
has previously seen. As we shall see shortly, this method has likewise prove 
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to be very instructive. It is, however, much at the mercy of response factors 
(ability to draw, for example) and variability in the judgments necessary 
to scoring. 


The Nature of Retention 


We can now turn to studies of the course and nature of retention. In 
general, there are two things which investigators have been concerned 
about: (1) quantitative changes in the amount remembered with increasing 
time since learning, and (2) qualitative changes and reorganization of the 
material originally learned. We shall deal with the question of quantitative 


changes in memory first. 


The Course of Retention 

Let us look first at the studies of Ebbinghaus on the course of retention. 
He was much influenced by the notion that the elements of mental activity 
Come in discrete packets called “ideas.” The basic problem in the study of 
mental activity was how dj nt ideas became associated and then disasso- 
ciated. Ebbinghaus wanteW@® study the association of ideas in its simplest 


form, that in which completely new ideas are formed by the subject and the 


Course of their retention examined by the experimenter. Hence the inven- 


tion of the nonsense syllable. 
We do not need to accept Ebbinghau 


Of the association of ideas, but one resu 5 
retention of material in which, at the outset of the experiment, there was 


little or no associative value. In the light of our discussion about the effects 
ot Statistical dependency in material to be learned, we can see that his ma- 
torial was very difficult to learn and remember. 
The results of Ebbinghaus’s measurements of retention are shown in Fig- 
“re 48. The most important feature of this curve is its extremely rapid fall. 
OSt forgetting, or loss of retention, occurs in the first few hours after 
Origina] learning. 
The curve shown in Figure 48 has frequently been characterized as the 
typicay retention curve, which in one sense it is. There are several important 
78s to keep in mind about this curve, however. For one fing it was 0b- 
ied with one highly practiced subject—Ebbinghaus himself. Another 
ant thing to observe about this curve is that it is based on the savings 
Le ek Bt find many examples of the 
EE © older psychological literature, one may HnC bh ah E 
tement, by various methods, of the amount retained over various pe. 


ss devotion to the classical theory 
Jt of his views was that he studied 
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riods of time and for various kinds of learning material. In general all these 
examples agree with Ebbinghaus’s results in one respect— decline in reten- 
tion is negatively accelerated (McGeoch, 1942; McGeoch and Irion, 1952). 
This seems to be true for the measurement of the retention of nonsense 
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syllables by other methods, the retention 
tention of psychomotor skills. Of course, 
Practice reminiscence may occur and lead 
retention. Under well-distributed 
the retention curve seems to be 


of meaningful material, and the re” 
after special conditions of masse 
to a temporary rise in the curve © 
practice, however, negative acceleration © 
the general rule. 

The rate and limit of forgetting is determined by many factors. The greater 


the degree of organization in Material learned, the slower the rate of for 
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getting and the higher the limit of retention. This in part is because of the 
greater redundancy of highly organized material. We know that when an 
individual is required to remember more material within a given context, 
there is lower retention measured by savings (Newman, 1939). Therefore, 
higher redundancy produces better retention principally because of the 
smaller number of things to be remembered. By the same token, memory 
for ideas in a prose passage is better than memory for factual detail (Briggs 
and Reed, 1943). 

Another important determiner of the rate and extent of forgetting is 
amount of practice. It is well established that amount of retention is roughly 
Proportional to amount of original practice (Ebbinghaus, 1885; Krueger, 
1929). This relationship is sometimes expressed by saying that overlearning 
is beneficial to retention. Overlearning means continuing practice after some 
material has already been completely mastered. In general this point is well 
taken; practice beyond the stage at which an individual reaches a criterion 
of 100 per cent correct responses does improve retention, measured either 
by recall or savings. A more thorough study of this problem is needed, 
however. It is unlikely that practice beyond the criterion of one errorless 


trial would indefinitely continue to increase retention and that the relation- 


Ship between practice and overlearning would be the same for all kinds of 


Material, At present, however, we are not able to state clearly the limits of 


this relationship. 


Retention and Speed of Learning 
Most Psychological textbooks, when discussing retention, attempt to 
Correct” a common misconception by pointing out that there is a positive 
relationship between speed of learning and amount retained. That is to say, 
Contrary to folklore, which holds that someone who learns slowly retains 
Well, the experimental literature apparently demonstrates that people who 
Cam slowly retain comparatively little. 
A Paper by Underwood (19546), which considers the problems of meth- 


0 dE I f 
ds of measurement in comparing individual differences in speed of learning 
mple as it first appears. 


Ey BA f 
nd tetention, shows that the question is not sO si 
, Originally collected 


Mderwood analyzed many data on rote serial learning, 
OT other purposes. These, like earlier data, showed that there was nearly a 
Zero Correlation between time to learn and amount recalled; there was a 
le Positive correlation, however, between time to learn and time to re- 
1. Underwood further analyzed the problem by finding a number of 


ite LE | [’ 
ms which were anticipated an equal number of times during learning by 
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fast and slow learners. On the average, of course, fast learners anticipated 
more frequently than slow learners, but it was possible to find some items 
in which the anticipations for fast and slow learners were equal. Even for 
these items, however, retention was better for fast learners. 

This means, in effect, that any given correct anticipation strengthens an 
association more for a fast learner than for a slow learner. If this is So, de- 
gree of learning of the material is not equal before introduction of the re- 
tention interval. With this in mind, Underwood adjusted the association 
strength of items for fast and slow learners. When this Wwas done, there was 
10 difference in retention between the two groups. 

The net effect of this analysis is to show that very likely the critical dif- 
ference between fast and slow learners is simply the associative strength of 
particular items at the end of learning; fast learners have acquired more 
associative strength for particular items. It does not mean that fast learners 
have an inherent ability to remember things; it simply means that in a given 
amount of practice they generally acquire Sreater associative strength for 


the things they are learning. Perhaps they have more efficient means of 
coding (see below). 


Reorganization in Retention 


There is a good deal more to the Problem of retention than simply 
whether or not responses are retained; they are reorganized in accordance 
with the individual’s previous habits, motives, and other unique character- 


istics. Such reorganization is perhaps the most fundamental 
tention and we shall want to 


and subsequent sections. 
Clustering. 

ganization of ve 

tering discovere 


aspect of re- 
Pay considerable attention to it, both in this 


One of the clearest and most important examples of reor- 
Tbal material in memory occurs in the phenomenon of clus- 
d by Bousfield and his associates. Clustering can be demon- 
strated by the following experiment (Bousfield, 1953): Subjects listen to 
a list of 60 words. These fall into four categories—animals, names, pro- 
fessions, and vegetables. There are 15 Words in each category. The 60 
words are presented in random Order, so that words from the same category 
occur together only by chance. Immediately afterwards the subjects write 
down, in order, all of the words they are able to recall. The two important 
results of this experiment were: (1) the subjects recalled more words than 
they would with lists of random words, and (2) they tended to recall words 
in the same category together. 


As we have seen, in free recall of unorganized material subjects tend to 
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recall the last items of a list first and the remaining items in order of de- 
creasing associative strength (Deese and Kaufman, 1957). When items can 
be categorized by the subjects, however, they reorganize the list so that they 
tend to give similar items together. 

Bousfield points out that one of the principal determiners of whether or 
not a subject will recall a particular word in a list is the subject’s previous 
experience with it. If it is a word that has occurred frequently in the sub- 
Ject’s past it is more likely to be recalled than if it has occurred very infre- 
quently.1 This simple law of habit strength cannot account for clustering, 
however. Bousfield and Cohen (1953) have emphasized the importance 
Of a second determiner of clustering. This they call a relatedness increment. 
They believe that words related to one another facilitate one another in 
Tecall by raising in strength a superordinate category of words to which ali 
the Words in a given category belong. The superordinate category is ac- 
tivated by a single high-strength word from the list; the category in turn 
then arouses the low-strength words in the list. For example, the occurrence 
Of the word “Jeopard” activates the mediated response “animal,” which in 
turn arouses the other subordinate members of this class (Bousfield and 
Cohen, 1955). 

h The important point about this notion is that the mechanism of clustering 
1S not a direct matter of association. Rather it seems to be the result of an 
‘ntervening mediational mechanism which is aroused by words on the list. 
Such a mechanism not only codes items presented to the subject in such a 
Way that he can more easily recall them, but it reorganizes the order in 


Which i PO? 

hich items are emitted in recall. 

{ Number of categories in recall. 
Oncerns the number of categories into which the items on a test of recall 


May be classified. If one holds the length of the list constant and varies the 
of items in a category decreases. What 


An important problem in clustering 


nu Hl 
h Mber of categories, the number 
appens to recall under variation in the number of categories? One might 


€ . 2. 
Xpect that recall would decrease as the number of categories increased be- 
Cc B f 
use too large a number of categories might exceed the immediate memory 


Span (which is about seven items for adults). On the other hand, if the 
umber of categories is reduced too far, there might be too many items in 
ven taking into account 


an; K 
y one Category for the immediate memory Span, € 


y of experience and habit or association 
is very likely negatively accelerated. That is to say, a word that is extremely 
is not much stronger than one that is only moderately familiar, but a mod- 
amiliar word is very much stronger t 


1Th } 
strength relationship between frequenc 


Amiliar 


era 5 
ately 1. han an unfamiliar word. 
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the advantages of coding. Perhaps one would expect some optimum num- 
ber of categories for a list of a given length. | 

Actually, there has been little experimental work on this point. bee: 
(1954) has studied frequency of recall for lists 24 items in length compose 
of two, three, and six categories, which in this case were composed of Hames 
of artists, athletes, musicians, etc. The experiment showed that recall As 
creased slightly as the number of categories increased from two to SiX. 
This is perhaps what one would expect since six categories are not too great 
a burden on the immediate memory span (assuming that the subject could 
remember one name from each category). The names themselves, however, 
perhaps have low associative strength for any particular mediating superor- 
dinate, with the result that as the number of names gets too large and the 
number of categories gets too small, the probability of arousing a good 
Superordinate is reduced. 

The net result of all this is to suggest that we should be extremely CHU 
tious in generalizing the results on this point from any one limited experi- 
ment. If we knew more about the Ways in which people formed their own 


mediating responses we could predict somewhat more adequately the spe- 
cific effects of number and kinds of cate 


to be no way of finding out about these. 
The immediate memory span and recoding. 


gories, but at present there seems 


We have mentioned im- 
mediate memory span frequently in the last few pages. It is of fundamental 


importance in the understanding of the Organization of retention, and it 
deserves some attention in its own right. 

The immediate memory span is the number of items of a particular kind 
which can be accurately reproduced after a single exposure. The immediate 
memory span for digits turns up as a standard item on nearly every intelli- 
gence test, and the norms for such tests tell us that the average adult is able 
to reproduce six or seven items accurately 50 per cent of the time. The 


immediate memory span is smaller with children and indeed, the fact that it 
regularly increases with age is one of the reasons it 


has a place in intelli 
gence testing. 


The size of the immediate memory span (in number of items correctly 
reproduced) varies with the material used for testing. For example, it 1S 
somewhat smaller for unrelated monosyllabic words than it is for decimal 
digits. Now this fact might lead one to conclude that the number of items 
of a particular kind that an individual can remember after just one hearing 
is determined by the amount of information per item in the testing material. 
Miller (1956b) has explored this hypothesis and found that it does not work. 
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| We can think of the amount of information as the amount of uncertainty 
in any particular item about which we have to guess. Thus, if I ask someone 
ডি guess what word beginning with “A” I am thinking about, his uncertainty 
will be high, because there are so many words beginning with “A.” If I ask 
him to guess what number between 1 and 10 I am thinking about, his uncer- 
tainty will still be high but not quite so high, since this time there are only 
109 alternatives. If I ask him to guess which hand has the coin, the uncer- 
tainty is still less, since there are only two alternatives. 

Information is an important concept in communication systems and com- 
puting devices, and when we say that a given communication system or 
computer has a given informational capacity, we mean roughly that it can 
reduce uncertainty about alternative answers to a problem only at a particu- 
lar rate. 

. If our memory storage system has a limited capacity to take in informa- 
ton, then we should be able to remember more binary digits * than decimal 
tnt more decimal digits than words in a test of immediate memory. 
ee 1s because there is less information in each binary digit than in each 

Cimal digit and there is less information in each decimal digit than in each 
word. 

Bs or a moment ago, however, that things are not that simple. Indeed, 
Ina at ur memory span for binary digits isin general larger than for deci- 
there ES and our memory Span for decimal digits larger than for words, 
eae not nearly enough difference between them to make information 
Hore ASOMSTETE, Indeed, we can remeriber only one or rf the most two 
matey re digits than monosyllabic words. Thus our ability to take in 
NAH and store it momentarily is not determined by the amount of infor- 

in the material. What then does determine the limits of immediate 
Memory? 

ney (1956b) has made a start at answering this question. First of all, 
se we have to accept the notion that what people remember in the 
been Memory span is chunks of things in which information has 
hen f EOOd in various ways. For things like words, decimal digits, and 

SO > ANy discrete series of items, there is not much difference in the num- 
pear] items that can be recalled immediately. Therefore, it looks a5 though 

e remember a constant number of chunks or items irrespective of the 


amo i E i a নত 
unt of information in each item. Miller reminds us that if an individual 
we can guess that he has an 


as an i ্‌ 
An immediate memory span of five words, 
n the 


im i a 
Mediate memory span of 15 or 20 phonemes, since each word, 0 


2 
* Bi ডঃ a? 
nary digits are numbers in which only two digits are used, zero and one. 
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average, will have three or four phonemes. But, says Miller, people do not 
code by phonemes; they code by words, ideas, and sentences. Thus if peo- 
ple can remember an average of two sentences of simple prose, these sen- 
tences represent information coded into five, six, or seven chunks. 

This process is called recoding. We hear something that has many chunks 
(say, words in a sentence) with relatively few pieces of information per 
chunk. We can take this sentence and recode it into fewer chunks (just the 
size to fit the immediate memo 


Ty Span) with more information per chunk. 
Thus we can get the “idea” 


Out of a sentence. We do not know exactly what 
this recoding entails in every instance, but frequently it is probably a way 
Of grouping the pieces of incoming information by applying some familiar 
name to the groups or pieces. 

Miller reports an experimental attempt by one of his associates, Sidney 
Smith, to recode binary digits in an efficient manner. By regrouping the 
binary digits into octal units ® Smith could remember 40 binary numbers 
after a single hearing. How many items out of a sequence like this can 
you remember after one reading: 0101 10100100110101101011011100- 
read through the series you could find a 
iced people would probably try reorganiz- 
uence like this: 010-110-100-100-110, etc. 
S recoding by assigning names to the pos- 
ctice assigning names, you should be able to 


a considerable extent simply 
tion of Items into new Sroups. Finally, when we have reorganized the maz 
terial into a small enough num 

Nonsense materials are harde 


2 T to memorize than sensible verbal material 
simply because the latter com 


mediational process, and w 
come to a discussion of pri 


8 Binary digits coded by assigning numbers 0 thr 


01 
. Thus 1 
would be coded as “5” and 100 as EE ough 7 to groups of three. 
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Perceptual Memory 


Let us now turn briefly to some problems in perceptual memory. Memory 
for visual forms, particularly abstract visual forms like circles, squares, and 
combinations of these, has been the special province of gestalt psychology. 

The gestalt approach to perceptual memory. Gestalt psychologists have 
generally taken the view that there are “fields of organization” in perception 
gnd memory for perceptual events and that these are related to “physiologi- 

weal fields” in the cerebral cortex. They have considered the “memory trace” 
fora Perceptual event to be autonomously altered by events intrinsic to the 
brain. In investigating the implications of these notions, they have studied 
the Perception of figures in great detail and stated some rather simple rules. 
that are Supposed to govern the nature of changes in the memory traces for 
Visual forms. 

In general, gestalt psychologists have supposed that the memory traces 
for forms become better structured over a retention interval. Memories of 
forms are supposed to become simpler than the original perception. Experi- 
Mental studies have not always confirmed the specific predictions various. 
Sestalt psychologists have made about the direction and extent of the 
Changes in forms during a retention interval, and the actual events seem to 

€ More complicated than gestalt notions allow. | 

There is one point which the gestalt psychologists have insisted upon, 
Over the frequently vocal objections of more associationistic Psychologists, 
and in this they have turned out to be absolutely right. This is the general 
Proposition that there are systematic reorganizations going on during the 
Tetention interval. The memory trace for a perceptual event is not just a 
Sradually fading picture of something we have seen in the past; it is some- 
thing that is being actively reorganized and reconstructed. 

It is likely that this reconstruction is much the same process as that we 

Ave just examined in the context of verbal retention. In other words, it is 
i Teorganization that has as its roots certain recoding processes made pos- 
sible by the fact that our perceptual world obeys probabilistic rules and that 
We have aj had experience with these rules. Some of the most interesting 
Studies on the way in which these recoding processes work have come from 

he classical investigations of Bartlett (1932), and we shall now look at 
Some of His nore important conclusions. 

Barner studies. First of all, Bartlett points out, when we are faced 

With ap unfamiliar form, we do not just try to receive an impression of it 
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as it stands. We try to reconstruct it in such a way that it is easy to assimi- 
late into our past experience with the world. There is, in Bartlett's words, 
an effort after meaning. This may be so simple as to be merely giving a 
name to the figure. Unless the figure is very simple and almost completely 
specified by the name, the figure will be schematized in memory. For exam- 
Ple, the form in Figure 49 was frequently labeled by subjects in a percep- 
tual-memory experiment as a “dog’s head” (Deese, 1956). Such a designa- 
tion will surely influence the way in which the form is retained. 

Bartlett gives us several examples of the transformation of visual form in 
memory by the operation of schematic recoding. One of the best-known 
examples is reproduced in Figure 50. In this experiment the original draw- 
ing in the upper left-hand corner was presented to a subject. He was asked 
to reproduce this drawing from memory. The result is reproduction 1. This 
reproduction was then presented to another subject, and this subject was 
asked to reproduce the figure. We can clearly see how the form was trans- 
formed by the various schemata adopted by the different subjects. In gen- 
eral it is changed from the conventional ancient Egyptian symbol for “owl” 
to a picture of a sitting cat. 

Bartlett describes at great length the various ways in which perception 
itself and remembering are transformed by schematization. The schematiza- 
tion partly depends upon the redundancy in visual forms (Attneave, 1954), 
and schematization based upon redundancy is likely to produce transforma- 
tions! in memory in the direction of simple, more symmetrical forms. There- 
fore it seems likely that some of the gestalt laws of perception and perceptual 
memory have as their basis the redundancy that exists in most forms of our 


Figure 49. A “nonsense” form generally recoded by labeling when individuals are asked 1° 
remembEr the form The Tos fegyeifnameosignedhtedtonm byl sro ofc, bronte ved 
head." (Deese, 1956.) SRE 8 RE! 
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Figure 50. A form serially reproduced from memory by different subjects. The form is altered at 
each reproduction. The influence of labeling as a coding device is seen in the change from a con- 
Ventional Egyptian symbol for an owl to a picture of a cat. (After Bartlett, 1932.) 
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experience. Redundancy, of course, means that some of the information pre- 
sented in a visual form is superfluous; thus we can recognize a well-known 
Person from a caricature of a few lines about as well as we can from a 
Photograph. 

« Ways of schematizing. We have already mentioned some of the ways 
In which forms may be schematized for visual memory. They may have 
Names attached to them or be reconstructed by some simple principle of 


if there is high redundancy in them. Actually the ways 


design, Particularly Eo 
nd it will only be worth- 


'n which schematizing may occur are numerous, a 
While to illustrate a few. 


Simple forms may be reproduced even ift A 1 
Much as a list of random digits may be reproduced in the immediate mem- 


Ory span. If nonsense forms become too complicated, however, they must 
be learned by rote, and the length of time this takes is about the same as for 
Any other Kind of nonsense material with an equal number of items (Pitron, 
1920). The determinants of memory for any set of perceptual events, how- 
ever 


hey cannot be well schematized, 


» Are quite complicated. 
For example, if subjects are re 
otms, recoding is likely to be quite 
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em by some isolated segment, characteristic of a form. If rec ) 
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than a simple one, because the former will have a greater probability of 


955) on symmetry. 


quired to recognize rather than reproduce 
different. They may recode by naming, 
they may try to identify 
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having some really distinctive, isolated segment (Deese, 1956). On the 
other hand, if the subjects are required to view too many different forms, 
recognition for simple forms may be better than for complicated forms be- 
cause the number of distinctive segments the subject may have to try to retain 
for the complicated figures exceeds the immediate memory span. 

The importance of the limitation of recoding is illustrated in an experi- 
ment by French (1954). This investigator had subjects identify patterns of 
dots from memory. The subjects were required to learn a name associated 
with each pattern of dots, which were arranged randomly so that recoding 
by familiar names could occur only by chance. The results were that sub- 
jects could more easily recognize patterns with six to eight dots than they 
could patterns with fewer or more dots. Ease of identification was asso- 
ciated with patterns of dots arranged either symmetrically or in linear 
arrays. In other words there was better perceptual organization in the pat- 
terns of six to eight dots. With fewer dots there was less likelihood of organ- 
ization (and hence, of recoding), and with more dots the patterns became 
too cluttered. 

The ability to recode our visual experience, for one thing, appears to be 
a function of the frequency of exposure to certain forms. In one experiment 
(Arnoult, 1956) subjects were exposed to nonsense forms a varying num- 
ber of times. These same subjects were later asked to rate these forms for 
familiarity. There was positive relationship between the frequency of ex- 
posure and familiarity, much as there is with unfamiliar words (Noble, 
1954). Furthermore there were positive relationships between the recog- 
nizability of the forms and their frequency of presentation and rated famil- 
iarity. 


Dynamic Factors in Memory 


If one reads enough books in psychology he will get used to seeing the 
phrase dynamic factors. Generally this refers to individual differences in 
personality, motivation, emotionality, and related factors. Nearly all psy- 
chologists assume that such factors influence memory. Sometimes particu- 
lar theories are quite specific about the ways in which dynamic mechanisms 
may influence memory, as is the case with the Freudian notion of repres- 
sion. More often, however, there is simply general agreement that the spe- 
cific personality mechanisms of an individual will influence things that he 
remembers. 

Individual styles in recoding. Again, we return to Bartlett's studies 
(1932). One of the most influential aspects of these was his observation 


Ee 


RETENTION AND FORGETTING 253 


that the nature and direction of schematizing, or recoding, is determined by 
the personal characteristics of the individual doing the recalling. The ex- 
tent to which personal characteristics determine the nature of schematiz- 
ing, of course, depends upon the material being recalled. In one of Bartlett's 
studies, he asked subjects to indicate their memories for pictures of faces 
by describing the faces after the subjects had seen them. This task appeared 
to be particularly susceptible to personal attitudes and motivation. A par- 
ticular face would be described in terms of the conventional attitudes to- 
Ward the type it represented. Thus, a picture of an enlisted man in the 
army would be described as representing a young, good-humored person, 
etc., and a picture of an officer would be described as stern, elderly, etc. 
Obviously, the attitude aroused by the pictorial schema or the name as- 
signed to the picture by a particular subject influenced his subsequent de- 
scription of the picture. 

We can generalize this kind of observation by saying that the memory 
for a particular event is influenced by the schema the person uses to carry 
it in memory. The schema, in turn, is at the mercy of the individual’s mo- 
tivations and attitudes, some of which are specifically aroused by the Origi- 
nal event to be remembered and some by a label or mediating reaction given 
to the event to be remembered. 

Such observations have been extremely important for certain problems 
in Social psychology, and indeed, Bartlett's studies have been as influential 
in this area as they have been in the psychology of learning. They provide 
ies important point of contact between the principles of social and of indi- 
Vidual behavior. 

Selective forgetting. Retention—or the loss of it, forgetting—seems to 
be selective in nature. That is to say, we remember some things and forget 
Others, not because of the inherent associative strength of the material 
learned, but because they are related to our motivations and attitudes. In 
brief, it has commonly been assumed that people remember those things in 
Which they are interested. 

One of the best-known studies usually cited to support this general prop- 
Osition is by Levine and Murphy (1943). These investigators presented 
Subjects with material dealing with the Soviet Union that was either favor- 
able or unfavorable. Before the study, the subjects had been divided into 
ID groups on the basis of their attitudes toward communism—procommu- 
nist or anticommunist. Levine and Murphy obtained learning curves and 
retention curves for these two groups. It turned out that there was a marked 
lationship between the attitudes of the subjects and amount learned. Thus, 
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the procommunists were able to reproduce, during learning, a larger per- 
centage of the separate ideas from the procommunistic material than from 
the anticommunistic material. Unfortunately, however, because of the dif- 
ference in learning, the retention curves started at different places, and 
hence it is extremely difficult to tell whether there was selective forgetting 
or not. In other words, there was selective learning, but this tended to 
mask the possibilities of finding selective retention. 

Since selective learning seems to happen so much of the time, it is ex- 
tremely difficult to tell whether or not selective forgetting also occurs. One 
experiment at least suggests that it does. In this experiment (Taft, 1954) 
Negro and white boys were asked to recall material read to them aloud; 
the material contained favorable and unfavorable, ambiguous and neutral 
items about Negroes. On an immediate recall test the white boys recalled 
about as many neutral items as did the Negro boys, but the Negro boys re- 
called more emotionally toned items, both favorable and unfavorable, than 
did the whites. The greater recall for the emotionally toned material on the 
part of the Negro boys was probably influenced in part by the fact that the 
experimenter was white and was an authority figure of considerable im- 
portance. 

Thus far this experiment tells us little that is new. However, the experi- 
menter went one step further and asked for a delayed recall three days after 
the original recall. On the delayed recall the Negro youths were still su- 
perior to the whites in the recall of favorable material, but they showed 
considerably more forgetting of the unfavorable items than did the whites, 
so much so, as a matter of fact, that the whites were slightly superior on the 
recall of unfavorable items. 

Why did the Negro youths forget the unfavorable items? Very likely be- 
cause they fitted less well into their emotionally determined schemata about 
successful Negro baseball players (this is what the material was about). 
In addition, however, there may have been deliberate suppression of the 
material (unwillingness to tell the experimenter about the unfavorable ma- 
terial) or repression. This last mechanism is extremely important, and we 
shall deal with it later. 

Once again we must emphasize the importance of the present assimila- 
tion of what the individual perceives with what he has learned in the past. 
This will determine what an individual learns about a new situation and 
probably what he retains of it. Aside from the emotional and attitudinal 
content of this problem it is not basically different from the general ques- 
tion of transfer of training. A new situation arouses a set of well-established 
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and preferred associations in the individual, and it is to these responses 
that parts of the new situation become selectively attached. When the time 
for recall comes, the individual is able to bring out those parts of the new 
situation that can readily be aroused by strong associations—those things 
that have become assimilated into the psychological life history of the 
individual. 


THE MECHANISMS OF FORGETTING 


We have seen that individuals tend to remember things which are related 
in a certain way to what they have learned in the past. In most of the experi- 
mental literature we have described in support of this general proposition, 
the amount of learning was uncontrolled. Thus we have been dealing with 
both learning and retention. If we have established, however, that some 
responses have all been equally well learned, what determines how many, 
and which, of these responses will be forgotten? The answer to this ques- 
tion is the subject of this section. We shall deal with studies in which in- 
Vestigators have deliberately tried to produce forgetting and with the 
theories that have arisen out of these experiments. 

An assumption frequently made is that forgetting is simply a matter of 
lack of practice. A learned act, it is argued, disintegrates because time 
Passes without any practice, much as a footpath will be overgrown with 
Weeds if no one walks along it. This idea is reasonable enough, for in the 
Natural course of events, our learning seems to slip away from us with the 
Passing years. The schoolteacher who returns to the university for a summer 
Session after five years of teaching the sixth grade makes a determined ef- 
fort to impress her French instructor with the fact that she has not looked at 
French in years and consequently will not remember much. 

Arguments against the principle of disuse. Forgetting believed to be a 
tesult of Jack of practice usually is called disuse. There are many grounds for 
dismissing disuse as a serious notion in the psychology of learning. Mc- 
Geoch (1932) pointed out some of these in a classical paper on the nature 
of forgetting. The most important point that he made is that disuse does not 
explain forgetting if disuse only implies the passage of time, for time in 
itself does not cause anything. Events happen in time; certain conditions 
Change over time, and it is these that provide explanations. Thus we find 
that forgetting is not determined by passage of time alone, but by the na- 
ture of the events which fill a time interval. An analysis of these events will 
lead to an understanding of the nature of the forgetting process itself. 
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Transfer and Forgetting 


If we once admit the proposition that forgetting is determined by events 
filling a time interval, it immediately becomes apparent that the experi- 
mental designs for the study of transfer of training are also those for the pro- 
duction of forgetting. Negative transfer, then, is admitted to be one of the 
mechanisms of forgetting. 


Retroactive Inhibition and Proactive Inhibition 


Experimental designs. As we saw in the chapter on transfer there are 
two types of basic experimental designs for the study of transfer. The retro- 
action experiment measures the effect of an interpolated activity on the re- 
tention of some previously learned act; the proaction experiment measures 
the effect of an older learned act upon the learning and retention of a new 
act. The basic designs are as follows: 


Proaction 


Experimental group 


LEADS HABE aim aca s JEATIBTASEZ ou am x 500 Gan (recalls task 2) 
Control group 
REL 1° POPE CRETE learns task Doers sa. (recalls task 2) 


The influence of task 1 upon task 2 can be seen by examining either the 
learning of task 2 or the retention of that task sometime after its Original 
learning. 


Retroaction 


Experimental group 

LOaENStASK La aimee JearnS LAS DE 3.60 ws Haas tested on task 1 
Control group 

TEATS: EASE J, atts a, HESS 2 a eieiase ent i CAA tested on task 1 
The influence of task 2 is measured by a retention test (usually recall or 
relearning) of task 1. 

If the experimental condition impairs the retention of an old task (retro- 
action) or the learning or retention of a new task (proaction), negative 
transfer or inhibition has taken place. Hence the special names retroac- 
tive inhibition and proactive inhibition for the negative side of transfer. 
Most investigators have considered retroactive inhibition to be the more 
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important condition for the mechanisms of transfer in forgetting, but more 
recently proactive inhibition has been assigned an important role also. 

In order to see how the conditions of negative transfer have been applied 
to a theoretical analysis of forgetting, it will be necessary for us to examine 
Some of the principal experimental findings in negative transfer. In the 
next few pages we shall look at some of the more basic experimental vari- 
ables. 

Negative transfer and stimulus-response relationships. In the chapter 
On transfer we examined a diagram, the transfer surface, which implied 
that the major conditions for negative transfer between two tasks were (1) 
common stimulus elements between the tasks and (2) different response 
elements. The diagram implied that, holding stimulus similarity between the 
tasks constant, transfer could be varied from positive to negative by chang- 
ing the responses in the two tasks from being identical or very similar, to 
being very different from each other. 

This diagram was drawn by Osgood (1949) to summarize the findings 
of many different experiments. In other words, the basic relationships pre- 
Sented in the diagram are based upon empirical research. A large number 
of experiments, দি both the proaction and the retroaction design, have 
Shown that when stimuli are the same, but responses different, negative 
transfer results (Bruce, 1933; Gibson, 1940; Bugelski, 1942; Underwood, 
1945; Osgood, 1946; Morgan and Underwood, 1950). 

Thus, a safe generalization about paired-associate learning is that when 
the same stimulus items and different response items are used in two tasks 
there will be negative transfer. 

The transfer surface also implies that the direction and amount of transfer 
depend upon the degree of response similarity. As this similarity goes from 
8 maximum (identity) to a minimum (as Osgood suggested, opposite 
Words), transfer goes from maximum positive to maximum negative trans- 
fer. The empirical evidence, however, does not entirely support this rela- 
tionship between response similarity and transfer. 

f Some experiments appear to uphold the notion that transfer varies con- 
tinuously as a function of response similarity (Morgan and Underwood, 
1950). Other experiments cast doubt on the generality of this relationship. 
For example, Bugelski and Cadwallader (1956) performed an experiment 
designed to test all of the implications of the transfer surface. In this ex- 
Periment the greatest amount of retroactive inhibition was for responses in 
recall which were somewhat similar to responses in interpolated learning, 
Tather than for responses which were neutral or opposite. 


258 THE PSYCHOLOGY OF LEARNING 


This is a complicated and difficult issue to settle. It may be partially the 
result of the amount of interpolated learning, as it was in an experiment 
(Deese and Hardman, 1954) cited in the chapter on transfer. On the other 
hand, it may be the result of the way in which the subjects are able to re- 
code responses. For example, if during interpolated learning, the subjects 
become aware that the responses required are all antonyms of the responses 
required in original learning, the net effect, both on interpolated learning 
and recall of the original list, may be positive transfer. At recall the subject 
remembers that one of the words in the interpolated list was “swift” and 
quickly thinks of the antonym “slow.” If, however, the words on the two 
lists are unrelated to each other, no simple recoding will do. Perhaps maxi- 
mum negative transfer would be obtained in situations in which responses 
are simply unrelated or are so similar that the subject cannot remember 
which word belongs to which list. Suppose, for example, that a subject 
remembers that “swift” was a response item for one task and “fast,” for the 
other; since they are similar in meaning, however, he has difficulty in recod- 
ing them to their proper lists. He is likely, therefore, simply to guess one 
or the other, and there is a 50 per cent chance of making a mistake. 

This matter of the relationship between recodings available to the sub- 
ject and response relationships in negative transfer still needs a thorough 
experimental exploration. 

A comparison of proaction and retroaction. Several investigators have 
asked the question, Is retroactive inhibition greater than the proactive inhi- 
bition of retention when the same materials are used to test both? This is 
important, because we wish to know whether these two situations are meas- 
ures of the same psychological effect or whether one involves something 
more than the other. 

One classical experiment on this comparison (Melton and von Lackum, 
1941) used nonsense syllables and the technique of serial anticipation. Un- 
der these conditions retroactive inhibition was greater than proactive inhi- 
bition of retention. That is to say, an interpolated task has more inhibiting 
effect on the retention of an original task than an original task has on the 
retention of a task learned second. This finding will not violate the layman’s 
prejudices, since in the proactive experiment the more recent task is meas- 
ured for retention, and in the retroactive experiment the earlier task is the 
one measured. 

If, however, a time interval is allowed between original learning and the 
test for retention, the difference between proaction and retroaction disap- 
pears (Underwood, 1948). This again, perhaps, will not surprise the lay- 
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man, since the order difference between the tasks in proaction and retroac- 
tion will be slight compared with the time difference of a long retention 
period—if there is a long enough time lapse, the order of the tasks becomes 
unimportant. Thus, while there is a difference between retroactive and pro- 
active inhibition when the retention test is immediate or follows soon after 
original learning, the difference disappears if the retention test is delayed. 
This effect is largely the result of recovery from retroactive inhibition rather 
than increase in proactive inhibition (Underwood, 1948), though some in- 
crease in proactive inhibition probably does occur after a long retention 
interval (Duncan and Underwood, 1953). 

Proaction, retroaction, and degree of learning. Tf we compare the ef- 
fects of learning and retention of a second task after differing amounts of 
Practice on the first task, we find that both positive transfer and proactive 
inhibition of retention increase as a function of the degree of prior learning 
(Atwater, 1953). That is to say, the degree to which the acquisition of a 
new task is aided by previous learning depends upon the amount of practice 
On the original task (such aid is probably largely “learning how to learn”), 
but by the same token, the greater the learning of a prior task, the more 
it will interfere with retention of the second task. 

It is not clear, however, in what way amount of practice on the prelimi- 
nary task and on the second task interact. We do know that “overlearning” 
Of a single task makes it more resistant to ordinary forgetting. If the pre- 
liminary task is “overlearned,” the responses in it may be so thoroughly 
Associated with one another that the learning of a new set of responses 
Will not interfere with them. 


In the retroactive design increa: i ) 
More complicated effect, at least in rote verbal learning. With a fixed num- 


(nonsense syllables), an increase in prac- 


sed practice on an interpolated task has a 


ber Of trials on original learning L 
tice on the interpolated task first results in an increase in retroactive inhibi- 


tion. After the interpolated task has been practiced to a much higher level 
than the original task, however, further practice on the interpolated task 
Seems to decrease the amount of retroactive inhibition (Melton and Irwin, 
1940; Thune and Underwood, 1943). Thus, it is possible that if the tasks 
are highly differentiated by overlearning either the original or the interpo- 
lated task, retroactive inhibition may be less, at least in an immediate test 
Of retention. 

There is still some doubt, 


be. For motor learning, at least one experi 
Allister, 1952) shows that the amount of retroactive interference is simply 


however, about how general this finding may 
ment (Lewis, Smith, and Me- 
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inversely related to criterion level on an interpolated task. Another experi- 
ment on rote verbal learning (Briggs, 1957) shows that retroactive inhibi- 
tion increased to an asymptotic limit with an increase in practice of the 
interpolated task, but did not decrease. This, however, may have been 
simply because practice on the interpolated task was not carried far enough. 

In general, we can conclude that inhibition (either proactive or retroac- 
tive) grows with an increasing amount of practice on the interfering task, 
but it is likely that there is a limit to this increase; it may even be that with 
very high amounts of practice on the interfering task, the inhibiting effects 
may decline slightly. 


Theories of Retroactive and Proactive Inhibition 


What factors are responsible for proactive and retroactive inhibition 
effects? If we regard the experiments on these effects as models of the ex- 
perimental production of forgetting, then a reply to this question would 
provide us with some of the factors responsible for the forgetting process. 
The most frequently given answer at the present time is that competition 
between responses is one reason for both proactive and retroactive inhibi- 
tion. Before we examine some of the implications of this notion, however, 
we shall examine one of the other possible theories of inhibition, one which 
has been conceived of as interruption, or disruption, of a memory trace. 

Interruption of a memory trace. The classical theory of retroactive in- 
hibition is usually known as the “perseveration theory,” and the essence of 
this is that after a series of responses has been practiced, the effects per- 
severate for a period of time (presumably in the central nervous system) 
and that any other activity will interfere with these effects. This notion, 
incidentally, has been advanced to account for the distribution of practice 
effect. 

As far as retroactive inhibition is concerned this idea is of limited use. 
For one thing, it implies that retroactive inhibition should occur most 
strongly when a test for retention of interpolated learning is given immedi- 
ately after interpolated learning, and indeed, it should not occur at all if the 
test is delayed too long. This is contrary to fact (Postman and Alper, 1946). 

Also against this notion as a general explanation of retroactive inhibition 
is the fact that it does not predict the rather important effects of similarity 
(both with respect to stimuli and responses) upon retroactive inhibition. 
Furthermore, it does not allow for an explanation of retroactive inhibition 
and retroactive facilitation by the same mechanisms. Finally, it must lead 
one to the conclusion that retroactive and proactive inhibition of retention 
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are essentially different phenomena, since the perseveration notion cannot 
explain the latter. 

These facts have led contemporary theorists to abandon perseveration as a 
general theory for retroactive inhibition. Nevertheless, there may be some 
facts about learning which demand a construct such as perseveration, or at 
any rate, suggest the role of interruption of the memory trace immediately 
after it is formed as a factor in forgetting. For one thing there is the existence 
Of the immediate memory span. Why is our memory span for unrelated items 
limited to about seven items? Why cannot we hear a long string of numbers 
and then be able to recite them as well as we can when we split the string 
in two and recite the two halves separately? The fact that we cannot sug- 
gests that the later items we hear somehow interfere with our memory for 
the first items before we get a chance to emit or recode them. 

Hebb (1949) has suggested that there are two phases to the neural trace 
of Something learned, dynamic and permanent. The dynamic phase lasts 
but a short time, and, Hebb argues, it depends upon something like active 
recurrent circuits in the nervous system. If anything acts to interfere with 
the event to be remembered before it is transformed into a more or less 
Static memory trace, it is disrupted. চি 

Such an effect, however, probably has little to do with retroactive inhibi- 
tion as it ordinarily occurs. The “traces” established in a retroactive inhibi- 
tion experiment are durable and exist much longer than the few seconds in 
immediate memory span. Therefore the notions about consolidation or per- 
Severation of traces are not useful as explanatory constructs for the retroac- 
tive and proactive inhibition effects. They may be useful in explaining some 
Of the fundamental facts of learning, but thus far there has been little inter- 
est in theories of learning built on the notion of consolidation of traces. 

Competition theory. The most specific of the several related concepts 
that have been advanced to account for retroactive and proactive inhibition 


is the hypothesis of response competition. This asserts that there will be 


Mutual interference between two sets of responses to the extent that the 


Tesponses from the two sets compete with one another at recall. The extent 
to which one set of responses will compete with another depends upon the 


relative associative strength of the sets of responses to common stimuli and 


the inability of the individual to recode the responses in such a way as to 
Bilve them distinctive contexts. { 

. The effects of competition may be expressed in several ways: (1) The 
individual may be able to recall both sets of responses but may not be able 
to recognize which one is correct. (2) The individual may be able to recall 
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only one of the alternative responses and not the other. Competition here 
may be indicated either by (a) the recall of the incorrect response or (b) 
a longer reaction time for the recall of the correct response. (3) Finally, 
competition may depress all alternative responses, so that for everything 
except probing by a forced-choice method, the individual may not be 
able to recall either set of responses. Indeed both sets may be depressed 
enough for a third “irrelevant” set of responses to occur. 

Competition can occur only in a situation in which one of the alternative 
sets of responses is correct. If either one of the two alternative sets will do 
in response to a particular situation, competition, by definition, does not 
occur. Furthermore, competition will not occur when the individual can 
recall both sets of responses and both are acceptable. Neither of these condi- 
tions usually arises in formal tests of retention. For example, in answer to 
a completion item such as “the receptors for color vision are the = ॥ 
an instructor would accept only “cones,” not “rods or cones.” 

Implications of the competition hypothesis. The competition hypothe- 
Sis, even in the bare outline presented above, has several rather specific 
implications. For one thing it implies that the degree of competition between 
habits will depend primarily upon the associative strength of the habits for 
identical or related stimuli. In other words, the stronger a competing re- 
sponse, the more it will compete. Furthermore, if associative strength be- 
tween the common stimuli and the two sets of responses is equal, then 
temporal order should have no effect. This means, all other things being 
the same, that there should be no difference between retroactive and pro- 
active inhibition of retention. Likewise, there should be no difference in 
amount of retroactive inhibition produced by variation in the temporal 
position of interpolated learning. 

Another set of implications of the competition hypothesis is that both 
Proactive and retroactive inhibition should be determined by the similarity 
relationships between tasks. For example, if subjects learn two distinct sets 
of responses to stimuli, competition at recall will be greater if the stimuli with 
which the responses have been associated are identical rather than similar. 
In other words, something like stimulus generalization will operate. 

The competition hypothesis does not have much to say about response 
similarity, however, except that if responses are so imperceptibly different 
that either response will be accepted as correct, then positive rather than 
negative transfer will occur. Once the responses are distinct from one 
another, however, the role of degree of response similarity in the competi- 
tion hypothesis is not clear, since recoding is possible. Perhaps this is just 
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as well, for we have seen that the experimental evidence on negative trans- 
fer and response similarity is not unequivocal. 

On a number of these points the data are not really in disagreement with 
the implications of the competition hypothesis. To be sure, in a test of 
immediate recall there is likely to be a difference between the retroactive 
and the proactive inhibition of retention, but this disappears in a delayed 
recall test. The difference on initial recall may have to do with subtle 
factors of set and warming up. Certainly, in the main, the transfer surface 
is in accord with the implications of the competition hypothesis. The data 
On the temporal point of interpolation in retroactive inhibition are contra- 
dictory and confused, which is perhaps exactly what one would expect from 
the competition hypothesis. 

Since competition is determined by the associative strength of the com- 
Peting activity, we should expect that retention in retroactive inhibition 
Would vary inversely with the amount of interpolated learning and that re- 
tention in proactive inhibition would Vary inversely with the amount of learn- 
ing on the Original task. Again, this appears to be the case, with the possible 
exception of high degrees of interpolated learning in retroactive inhibition. 
The slight drop in retroactive inhibition that occurs in such a situation may 
be the result of differentiation of the two response systems through efficient 
recoding. 

Direct transfer and the competition hypothesis. Perhaps more impor- 
tant than these indirect implications of the competition hypothesis are direct 
indications of overt negative transfer. Direct expression of competition 
Can occur in two ways: (1) the reaction time for correct responses may be 
Increased by the need to suppress incorrect alternatives at recall, and (2) 
OVert intrusions of responses from the competing set of responses may actu- 
ally displace correct responses in recall. 

Both these effects occur. Postman and Kaplan (1947) have observed that 
Teaction time during relearning of the original responses is increased as the 
Tesult of interpolated learning. From this increase it is easy to infer that the 
Correct responses are inhibited for a time by an occasional tendency to say 
the Appropriate word from interpolated learning. Everyone has had the 
experience of not quite being able to recall something without collecting 
his thoughts for a moment, and this generally means picking the correct 
Tesponse from a number of alternatives. 

Of critical importance to the competition theory has been the observation 
Of direct intrusion of responses from the competing set of habits. Ordinarily, 
In a rote learning experiment, most of the “errors” are simply failures of 


264 THE PSYCHOLOGY OF LEARNING 


the subject to respond within the assigned time limit. Sometimes, however, 
subjects will respond with inappropriate responses, and these have provided 
a direct index of competition. The very fact that such intrusions occur is 
direct and conclusive evidence that competition is responsible for at least 
part of interference with recall. 

Melton and Irwin (1940) noticed that the frequency of intrusions of 
interpolated responses during the recall of an original task was greatest 
when the amount of practice on the original and interpolated tasks was 
about the same. If there was more practice on the interpolated task, in- 
terpolated responses tended to intrude less frequently during recall of the 
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Figure 51. Retroactive inhibition as a function of the number of trials of interpolated learning. 
The dashed lower curve shows the inhibition attributable to overt intrusion at recall. The upper 
dashed curve shows inhibition attributable to factor X. (Melton and Irwin, 1940.) 


Original material. This again is very likely the result of efficient recoding 
which leads to clear differentiation of the two sets of responses. Unfor- 
tunately, total retroactive inhibition continued to increase beyond the point 
at which intrusions of interpolated responses began to decrease. This situa- 
tion is illustrated in Figure 51; the curve labeled “RI attributable to factor 
X? is that portion of the total retroactive inhibition that could not reason- 
ably be accounted for by intrusions from interpolated learning interfering 
with the recall of original responses. The existence of this factor led Melton 
to a sweeping revision of the competition theory, and the resulting revision is 
usually called the “two-factor theory” (note: this two-factor theory is not the 
same as the two-factor theory of conditioning we considered in early 
chapters). 
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The two-factor theory. The competition of responses is clearly one 
factor in retroactive inhibition. Perhaps, however, other factors are at work 
in retroactive inhibition, and possibly also in proactive inhibition. One of 
these seems to be specific to retroactive inhibition. As a matter of fact, a 
difference between retroactive and proactive inhibition of retention in im- 
mediate recall has been advanced as one of the reasons why a second factor 
in retroactive inhibition is needed. Melton (Melton and Irwin, 1940; Melton 
and von Lackum, 1941) has suggested that this second factor is the “un- 
learning” of original responses during interpolated learning. 

Tf interpolated responses intrude at recall of original learning in retroac- 
tive inhibition, it seems reasonable to suppose that there would be also 
intrusions of original responses during interpolated learning. And, indeed, 
there are. When responses from original learning occur during interpolated 
learning they are incorrect, and hence, it may be argued, are weakened. 
Thus interpolated learning has the double function of (1) building up a set 
of competing associations and (2) weakening the set of original associa- 
tions. In proactive inhibition of retention only competition can work, because 
the competing responses are learned first, and thus the correct responses 
cannot be “unlearned” because of practice on the competing responses. 

It is extremely difficult to obtain any direct evidence for unlearning, be- 
Cause we do not fully understand how it works. Many experiments, how- 
ever, give indirect evidence for the existence of unlearning (Melton, cited 
above; Underwood, 1945, 1948). More recently, Briggs (1954, 1957) has 
charted the course of responses appropriate to original learning and those 
Appropriate to interpolated learning by means of specially prepared recall 
tests inserted at certain points during learning. These tests show that the 
frequency of original responses declines as interpolated learning is con- 
tinued. In other words original learning is weakened. Furthermore, these 
Tesponses “spontaneously” recover if a time interval is allowed between 
‘nterpolated Jearning and a test for retention of the original responses. 

This last Observation, which has been frequently made (Underwood, 
1948, etc.), has led to the view that the unlearning of original responses 
during interpolated learning is much like the process of extinction. This 
Comparison explains nothing, but it reminds us that unlearning, like extinc- 
tion, isin part only temporary in nature. Furthermore, again like extinction, 
Just a little relearning can cause all traces of unlearning to disappear. 

Thus, it seems likely that the retroactive inhibition effect has at least two 
Causes, (1) unlearning of the original responses during the learning of the 
Interpolated material, and (2) competition between responses at recall. 
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Proactive inhibition involves only the second factor. The relative im- 
portance of the two factors can be assessed by the fact that there is not 
much difference between retroactive and proactive inhibition of retention 
when conditions are otherwise the same and little or no difference when an 
interval of time is allowed to elapse. Thus competition at recall would seem 
to be the overwhelmingly more important of the two factors. 

The generality of competition. Looking back at Figure 47 near the 
beginning of this chapter we see that Ebbinghaus’s retention curve for non- 
sense syllables fell off very rapidly after original learning. Twenty-four 
hours after learning, Ebbinghaus showed a savings of about 35 per cent 
in relearning the same list. The general picture we get from textbooks on 
the psychology of learning is that such a result is fairly typical of the for- 
getting of nonsense syllables. Yet, more than one modern investigator has 
been puzzled to discover, by using college students as subjects (who are 
much less practiced at learning nonsense syllables than Ebbinghaus was), 
that their retention is much better after a comparable time interval. The 
students in the author’s course in experimental psychology, for example, 
typically show 75 to 90 per cent savings after 24 hours. What accounts for 
this difference? 

Earlier we pointed out that there were two basic designs for the study 
of retention. In one, the same subjects learned a different list of nonsense 
syllables for each retention interval; in the other, different subjects learned 
the same list of nonsense syllables. This accounts for the difference between 
investigations like Ebbinghaus’s which show low retention after a time 
interval and other studies which show relatively high retention. 

Underwood (1957) has shown how this difference is of fundamental 
importance for a general interference theory of forgetting. He demonstrates 
that the amount of forgetting of any one set of material is an increasing 
function of the amount of similar material the subjects have learned in the 
past. The curve in Figure 52 shows this relationship. Underwood obtained 
this figure by plotting the data from a large number of investigators; it shows 
very clearly that the per cent frequency of recall is inversely related to the 
number of previous lists the subjects have learned. 

The importance of this point is twofold. First of all, it clearly shows that 
the ordinary retention curves, in which no experimental attempt is made 
to produce forgetting, are determined by the same variables in the proac- 
tive and retroactive inhibition experiments. This fact places the competition, 
or more generally the interference theory of forgetting, on a much firmer 
foothold. 
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The second point is even more important. Underwood reminds us that 
most of the emphasis in the experimental production of forgetting has been 
On the retroactive inhibition design. But these data on the influence of 
previously learned material upon the retention of more recently learned 
tasks suggest that the proactive inhibition of retention is really the more 
important design in the experimental study of forgetting. 

To be sure, if the two-factor theory of retroactive inhibition is correct, 
retroactive inhibition involves a second factor, unlearning, not present in 
Proactive inhibition. We have seen, however, that the unlearning factor is 
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Much Jess important than the competition, or general interference, factor, 
and furthermore it appears to disappear to a considerable extent with rest. 

Therefore, what produces forgetting is, to a considerable extent, simply 
Mutua] competition among things we have learned. Fortunately, most of us 
Are safe from the possibility of a complete collapse of our associative proc- 
°sses by the fact that we (1) are continually relearning the things we need 
MM our daily life and (2) we have devised recoding devices for keeping im- 
Portant items free from competition. 

Thus I may glance briefly at a set of notes before I go into the classroom 
to deliver a lecture. This serves to reactivate the coding devices by which 

Ave stored many chunks of information. Just a word or two will enable 
Me to recy] a rather complicated theory, and I find as I talk that stories 


(alas, too familiar) and anecdotes flow automatically. 
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Direct competition and indirect interference. The method of paired 
associates provides the ideal model for the competition theory of forgetting. 
In the classical paired-associate experiment the subject learns two sets of 
responses to identical stimuli. When the stimuli are presented at recall 
there is competition between the two sets of responses. In actual practice, 
in an experiment on retroactive inhibition, competition produced by the 
alternate set of responses can account for nearly 70 per cent of the errors 
made by subjects at recall (Deese and Hardman, 1954). 

Unfortunately such direct evidence for competition does not always 
occur in experiments on serial learning by the method of anticipation. If 
subjects are asked to learn two lists of nonsense syllables by the method of 
anticipation, retroactive ‘inhibition occurs when recall of the first list is 
asked for; the second list reduces the tendency to recall the first. Direct 
competition from the first list, however, accounts for a negligible number 
of overt errors at recall (about 5 per cent) when recall for the first list 
comes right after the learning of the second one. What happens in this case 
is that the learning of the second list causes the subjects to confuse the 
proper location of items within the first (Deese and Hardman, 1954). Thus, 
while the second list does produce retroactive inhibition, it does so by pro- 
ducing confusion within the original list rather than by directly competing 
with it. 

Thus direct competition after interpolated learning is disappointingly 
small in serial learning. Apparently, what happens in this case is that the 
rather weak organization within the original list is interfered with. The 
contextual boundaries between lists, probably established by some primitive 
coding, are strong enough so that the two lists do not directly interfere with 
one another. These contextual boundaries are not strong enough, however, 
to remain intact with the passage of time. If time intervals of a few hours 
or a few days are interposed between interpolated learning and recall of 
Original learning with serial lists, there is competition at recall from the 
interpolated list (Deese and Marder, 1957). Thus direct competition occurs 
if a time interval elapses. The time interval probably weakens the contexts 
which held the lists apart; we should like to think that this weakening of 
context is also by interference, but we have no direct evidence for this. 

The point to be made is that interference may not always be exhibited 
in the form of direct competition at the time of recall. These are not 
grounds, however, for discarding the notion of competition in such cases. 
We pointed out earlier that competition may depress all “relevant” re- 
sponses so that irrelevant responses can take over. Thus, the addition of a 
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second list to the first does not have the effect of producing competition 
directly, but this addition at fairly low strength (because of the contextual 
boundaries between the lists) does depress the associations from the first 
list sufficiently so that their correct order is not maintained. 

In conclusion we can say that direct competition is an important de- 
terminer of the efficiency of recall. In addition, competition probably affects 
recall indirectly by competing with the correct responses so that other, 
irrelevant responses can displace them. Probably the conditions most 
directly affecting the extent to which competition takes place are similarity 
among the stimulus conditions associated with particular responses and the 
degree to which these responses are assimilated (recoded) into the indi- 
Vidual’s past history of learning. 


Repression 


The work of Sigmund Freud has been enormously influential in nearly 
all of the social sciences. The psychology of learning is no exception. The 
Concept of anxiety as a drive, the use of the technique of free association 
for Probing emotional processes, both have their origin directly or indirectly 
in the theories of Freud. Since Freud touched upon nearly all dynamic 
Processes in human behavior, he had something to say about forgetting. In 
this section we shall consider one of the most important of the possible 
forgetting mechanisms suggested by the work of Freud and his followers. 


The Concept of Repression 

Freud (1925) pointed out that one of the most effective methods an indi- 
Vidual can use to protect himself from the conflict between powerful in- 
Stinctive urges and the restrictive demands of the ego is through the 
Tepression of these urges and everything connected with them. This means, 
first, that the existence of these urges is not allowed to intrude into the in- 
dividuaps conscious life, and secondly, that all mental events associated with 
these urges are pushed out of consciousness. Thus repression has two 
forms. In the most fundamental sense repression refers to the blocking of 
Motivation. In a secondary way, however, it refers to an expulsion of certain 
associations from the conscious life of the individual. Repression requires a 
Constant expenditure of psychic energy, since it must hold down a drive, 
Which itself is expending psychic energy. Thus repression is an active 
Process. 

The fact that repression is active does not mean, however, that the indi- 
vidual consciously tries to keep threatening ideas or wishes out of his mental 
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life. On the contrary, such ideas would have to be admitted before they are 
suppressed. Freud is arguing that they are not allowed to come into con- 
sciousness in the first place. Thus, repression is a dynamic conflict at an 
unconscious level. 

For our purposes, one of the really fundamental aspects of the concept 
is that anything that is repressed is not lost. It is only submerged below the 
level at which, under ordinary circumstances, the individual can volun- 
tarily recall it. Furthermore, it does not take relearning to reactivate this 
material. If the source of conflict is removed, the repressed memory will 
recover without specific practice. It is this feature of potential recovery that 
is of critical importance to the role of repression in forgetting. 


Experimental Evidence for Repression 


Repression is a concept designed to explain certain manifestations of 
personality, certain kinds of amnesias, the course of an individual’s behavior 
during psychoanalysis, and certain effects under hypnosis. These things, 
however, only suggest what the role of repression is for associative proc- 
esses. Can ordinary associations be repressed? The only way to answer this 
question is to attempt to produce a process similar to repression in the 
laboratory. 

Unfortunately, the experimental evidence on repression is weak. This is 
partly because repression is not a well-defined concept (we may get involved 
in an endless quarrel over the meaning of the “ego,” for example) and 
partly because experiments on the production of emotional reactivity, which 
is necessary to repression, are difficult to perform and interpret. The essen- 
tial requirements for an experimental demonstration of repression are (1) 
the establishment of equal degrees of learning of some material in a control 
and an experimental group of subjects, (2) the imposition of some ex- 
perience likely to produce repression in the experimental group, (3) indica- 
tion of repression through forgetting, and (4) recovery from repression 
after the threat imposed in (2) is removed. 

Several experiments meet these requirements. For example, Zeller (1950, 
1951) managed to associate the retention of nonsense syllables with the 
threat of personal failure. Under these conditions subjects failed either to 
recall or relearn the syllables as well as a control group of subjects. When 
the threat was removed, recovery of memory occurred. A similar experi- 
ment by Aborn (1953), which used somewhat different learning materials 


and the measurement of “incidental” memory, showed essentially the same 
results. 
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These experiments leave many questions unanswered. From the stand- 
point of the classical theory of repression we may wonder about the role of 
consciousness. Presumably the subjects in these experiments were motivated 
to recall the material when under the repression conditions but were simply 
unable to do so. This is probably the only thing we can say, and in this 
sense these experiments fall short of psychoanalytic theory (though per- 
haps the role of consciousness in the theory needs examination). More 
important questions for our purposes concern the nature of the associations 
themselves under repression. What happens to the organization of the ma- 
terial? What is the role of associative strength? Can very strong responses 
or highly coded systems of responses be repressed? Is it possible for us to 
repress, say, the names of our parents or grandparents? These are all ques- 
tions which cannot be answered by the experimental data we have, and 
Perhaps they are beyond any of the experimental techniques available to 
us. Nevertheless, such matters need to be settled before the concept of 
repression can take the important role it deserves in the psychology of 
learning. 


In Conclusion 


fp We have seen that interaction between responses learned at different 
times is an important determiner of forgetting. We may well ask, however, 
Whether or not forgetting ever really occurs. If two responses compete at 
ne particular time, the stronger one will win out, but in order for competi- 
tion to occur, both must be there. Are responses then ever really elimi- 
nated? There is, of course, no way to answer So general a question. Never- 
theless we can say that the residual effects of something presented in the 
Past can be detected after long intervals of time and under very improbable 
Circumstances. Burtt (1941) describes an experiment with his son, in 
Which he read the boy passages, in Greek, from Sophocles’ Oedipus 
Tyrannus. There were three selections of 20 lines each, and for a period of 
time Burtt read these selections to the boy every day. The child was less 
than two years old. Six years later, he was required to learn by rote these 
selections plus some new ones from the same source. It took the boy an 
Average of 435 repetitions to learn the new selections and only 317 repeti- 
tions to learn the old ones. Thus, there was savings of better than 25 per 
Cent, 

Furthermore, we need to point out that the mechanisms of forgetting 
discussed in this chapter are not necessarily the only ones. Forgetting is a 
complex phenomenon, and there are undoubtedly many subtle features of 
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it that we have not yet begun to explore. We are just beginning to under- 
stand the important role of recoding in memory, both in retaining and 
altering things as they are retained. We need to know much more about how 
competition and interference interact with recoding. It is probable that once 
the material is in recoded form, it is relatively resistant to competition, but 
it may suffer considerably before it is thoroughly recoded. In addition, of 
course, in its recoded form it may be at the mercy of changes in motivation 
and attitude. 

In short, our past experience is being continuously altered by new ex- 
periences and changes in our interests and attitudes. Thus memory is neither 
a static thing nor the gradual decay of what was once learned. Our past is re- 
shaped for us by what we do now as much as what we do now is directed 
by our past. 


CHAPTER 11 


PROBLEM SOLVING AND THINKING 


We regard the ability to think as the supreme accomplishment of the evolu- 
tion of mind. That we should feel this way is understandable, for thinking 
1s the thing in which we human beings undeniably excel. The very fact 
that we characterize human thinking as a product of the evolution of mind, 
however, suggests that it has its antecedents in animal mentality. Further- 
More, the actual process of thinking and solving problems is learned, and 
We have already seen that many basic principles of learning are common 
to man and to higher animals. Consequently, thinking and its analogue in 
animal behavior must have an intimate connection with the basic principles 
of learning. In this chapter we shall attempt to outline some of these con- 
nections. 


PROBLEM SOLVING IN ANIMALS 


In the 75 years or so that psychologists have been examining animal 
behavior in the laboratory, an enormous number of examples of the ability 
Of animals to solve problems has accumulated. Not all of these will help 
lie to understand the nature of problem solving, but a few important ones 
Will Show how theories of problem solving have arisen. 

A problem exists when an organism is motivated to achieve some goal and 
there exists no ready means to reach it. Problem solving is the process by 
Which the animal achieves the goal. Since some examples of problem solving 
Merely illustrate the basic principles of instrumental conditioning, problem 
Solving extends down to the most elementary examples of learned instru- 
Mental behavior, Other examples of problem solving are quite complicated, 
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however, and the processes involved are evidently more elaborate than 
those in simple instrumental conditioning. 


Two Classical Viewpoints on Problem Solving 


Thorndike’s Trial-and-error Problem Solving 


The term trial and error applied to the process of problem solving was 
introduced by C. L. Morgan (1894), but its importance comes from some 
well-known experiments by Thorndike (1898). Thorndike reported exten- 
sive observations on the way in which animals, mostly cats, learned to 
escape from puzzle boxes. 

These puzzle boxes were simply enclosures from which the animal could 
learn to escape by turning a door latch, pulling a string, or pressing on a 
treadle attached to the door. Almost every student of psychology knows 
that Thorndike’s cats were very slow and ineffective in solving this problem. 

Like most cats, these evidently did not relish being penned in, and they 
would attempt to escape almost as soon as they were placed in the box. 
These attempts, however, were neither systematic nor fruitful. After vari- 
able amounts of time spent thrashing around, the cats would, quite acci- 
dentally as a rule, happen upon the device for opening the door. Usually, 
as soon as the door was open, the cats could leave. When they were re- 
turned to the box, however, they usually began their futile direct escape 
attempts all over again. It generally took quite a few trials for them to learn 
to turn directly to the device that opened the door, and the learning was 
usually an irregular, hit-or-miss affair. 

These cats did not appear to be particularly intelligent. They showed 
little ability to grasp and retain the principle of the puzzle box after having 
once happened upon it. Thorndike tested a number of other kinds of ani- 
mals, and while, in general, monkeys and dogs did somewhat better than the 
cats, he was not greatly impressed with the need for supposing that any very 
high level of mental activity took place in his animal subjects. 

These experiments on the puzzle box do not differ in principle or much 
in detail from the experiments on the way rats learn to press levers in 
Skinner boxes. As a matter of fact, Thorndike’s experiments led him to 
state the principle of effect as the basic mechanism in learning. The principle 
of effect refers to what we described in earlier chapters as the principle of 
reinforcement in instrumental learning. He pointed out that the essential 
element in the learning of instrumental behavior is the association of some 
response with a reinforcement. Gradually, a response so reinforced gains 
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prepotency over other responses and comes to dominate the behavior of the 
animal in the testing situation. The behavior of animals is blindly de- 
termined by the circumstances of reinforcement. There is little in animal 
behavior to suggest the need for more complicated processes, or at least 
these seemed to be the implications of Thorndike’s work. 

Of course many investigators of animal behavior resented the implica- 
tions of the pure trial-and-error description of animal problem solving. 
Both before and after Thorndike's experiments, numerous investigators pre- 
sented evidence for what they believed to be far more complicated proc- 
esses in animal learning and problem solving. 

The notion that Thorndike was grossly in error and that problem solving 
in animals is a much more complicated and intelligent process than he sup- 
Posed finally came to center around the work of W. Kohler on the intelli- 
gence of anthropoid apes. We shall briefly review Kohler’s work and the 
implications that have been drawn from it. 


Kohler’s Insightful Problem Solving 


Kohler (1925) studied in great detail the problem-solving behavior of 
anthropoid apes, particularly chimpanzees. He was critical both of Thorn- 
dike’s results and methods, and he designed problems for animals which he 
felt would more adequately probe their mental processes. 

Kohler studied the ability of his animals to solve a wide variety of instru- 
mental problems. These were always designed, in contrast to Thorndike's, 
SO that all the elements necessary for solution could be perceived by the 
animals. There were no trick doors or hidden mechanisms that delivered 
food. For example, in some of Kohler’s best-known observations, he simply 
Suspended the animal’s daily ration of food from the roof of the animal's 
Cage. A box was placed some distance from where the food was suspended. 
‘The Problem for the animal was to perceive the relation between the box 
And the location of the food and to act accordingly. 

In one case, Kohler reports, an animal spent a fruitless day trying to 
Scramble up to the food by various means. When Kohler demonstrated the 
Solution to the animal, however, by placing the box under the food and 
touching the food while standing on the box, the animal immediately got the 
Solution. There was no trial and error, no contingency between responses 
And reinforcement. In other examples of the box problem in which the ani- 
Mals more or less arrived at their own solutions, Kohler thought that the 


animals showed clear evidence of foresightfully planning behavior in ad- 
Vance. 
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Such planning is quite a different matter from trial and error. Since 
Kohler believed that his animals did solve the problems by foresightful plan- 
ning, he characterized their solutions as insightful. By this he meant that ani- 
mals did not accidentally happen upon a solution; instead, they saw the rela- 
tionship between the elements of a problem in such a way that they arrived 
at an effective solution before they actually engaged in behavior which 
demonstrated solution. 

‘This means in effect that the animals tried out hypotheses based upon 
the perceived relationships in the environment. Sometimes, of course, the 
hypotheses were incorrect, but even an incorrect hypothesis, Kohler argued, 
is not the same thing as blind, unplanned trial-and-error behavior. 

How do we know when insight occurs? It is necessarily an inference 
from observed behavior. The characteristics of problem solution that led 
Kohler and other psychologists to identify solutions as insightful are many. 
Perhaps the most basic is the suddenness of the solution. This is well illus- 
trated in the example given below. In addition, insight is inferred when the 
solution (if it is correct) persists. Remember that trial-and-error solutions 
are characterized by irregularity. The animal may take longer to arrive at 
the solution on the third trial than it did on the second. This is generally 
not true for insightful solutions. Once the animal gets the solution, it can 
demonstrate it immediately upon being exposed to the problem. Finally, 
Kohler claimed, it is possible to infer from the behavior of the animals that 
they were able to put together objects in the environment that were separate 
and unrelated. These characteristics of insightful solution are illustrated 
in an example which is taken from Yerkes (1943). 

An example of insightful solution. Yerkes placed an ape in a room with 
an oblong box that was open at both ends. The box was about 170 centi- 
meters long and very narrow. There was a small door in the top of the box 
at the middle. This door could be opened by the experimenter, and a banana 
could be locked inside while the ape watched. A pole was also placed in 
the room; the pole was just the length of the box. Thus, the ape could get 
the banana by pushing it out one end of the box with the pole. 

Initially, the ape exerted much effort trying to pry open the latch on the 
door in the middle of the box. Gradually it lost interest in this endeavor 
and began to show signs of frustration. It even attempted to induce the 
experimenter to obtain the banana. At one point, however, the animal 
solved the problem. Its solution came when it began playing with the 
pole and rolling it around the room. The pole rolled into position alongside 
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the box. The ape reached into the box with its arm in a vain attempt to 
reach the banana. It then ran to the opposite end of the box, where it again 
looked in. Then, suddenly, the ape turned to the pole and instantly pushed 
it through the box and then ran around to the other side to obtain the 
banana. 

How can this behavior be explained away as an example of instrumental 
conditioning? Indeed, it is difficult to do so. The ape appeared to see very 
readily the relationship between the box and the stick. This, by no stretch 
Of the imagination, is similar to the rat's perceiving the relation between 
Pressing the lever and obtaining food in the Skinner box. 

It is another matter, however, to say that one therefore needs an entirely 
new principle of behavior to account for the observations on insightful 
Problem solving. Such a need diminishes further when it becomes apparent 
that there is a more or less continuous gradation between trial-and-error 
Solution and solution by insight. Furthermore, some observations indicate 
that a background of simple instrumental learning is necessary for the oc- 
currence of insight. In other words, insight depends upon and grows out of 
instrumental learning and trial-and-error behavior. 


Bridging the Gap between Insight and Trial and Error 


The background of insight. Some investigators, not merely content with 
characterizing problem solving in the higher animals as sometimes insight- 
ful, have tried to find the elements in the background of experience neces- 
Sary for the occurrence of insight. 

For example, Birch (1945) presented some chimpanzees with the prob- 
lem illustrated in Figure 53. As we can see from this drawing, the food was 
Placed within the blade of a hoe, and all that was required of the animai 
Was that it rake the food in. Four out of six animals failed to solve the prob- 
lem within the 30 minutes allowed. One animal succeeded after it acci- 
dentally moved the food into the cage. This, of course, is pure trial and 
error. Another animal succeeded in what appeared to the investigator as 
an insightful solution. Thus, despite the simplicity of the problem and the 
Seneral intellectual superiority of chimpanzees in the animal world, insight- 
ful solutions were not plentiful in this example. 

Birch went one step further, however. He provided his subjects with 
Sticks to play with. In the course of the stick play, the animals behaved in 
Such a way that Birch inferred that they were using the sticks as functional 
extensions of their arms. Such learning is probably well described by the 
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model of instrumental conditioning. The important point, however, is that 
after a certain amount of play with the sticks, these chimpanzees could 
solve quite complicated problems, including the hoe problem, which de- 
manded the use of sticks of various sorts. 

The implications of this study are clear. Insight depends upon a back- 
ground of simple instrumental habits. If the animal has not learned these, 
the probability of insight is extremely low; indeed, solutions to problems 
are by trial and error. In many instances the occurrence of insight is simply 
by generalization of lower-order habits. Thus, in some cases at least, in- 
sightful problem solving is simply the result of positive transfer. 

The extension of trial and error. What about Thorndike’s cats? Are 
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Figure 53. The hoe problem. The chimpanzee can rake in the food, which is out of arm's reach, 
simply.by‘pulling in thé hoe: (After Birch, 1945.) 


cats exceptionally stupid animals, or can they too solve problems by in- 
sight? This is not a very elegant way of putting the scientific problem, but the 
fact is that other experimenters have noted considerably more intelligent 
behavior in cats than did Thorndike. Even in the same situation, the trial- 
and-error behavior noticed by Thorndike was probably due to inexperience. 
Adams (1929) repeated Thorndike’s experiments and came to the conclu- 
sion that Thorndike’s cats, in many instances, were simply too excited and 
untamed to demonstrate anything more than random thrashing about. 
Insightful and purposeful solution of problems is not the exclusive prop- 
erty of primates. Indeed, the lowly laboratory rat can solve problems in- 
sightfully if conditions are right (Maier, 1929). Furthermore, pure trial- 
and-error behavior is not unknown among adult human beings. Despite 
the fact that people use the manipulatory symbolic function of language, 
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they still indulge in blind trial-and-error behavior when faced with an un- 
usual and difficult problem (Ruger, 1910). 

Bridging the gap. Harlow (1951) presents cogent arguments, some of 
Which are like those presented above, to explain why it is not necessary to 
postulate two basic and distinct processes in problem solving. Trial-and- 
error behavior is simply the establishment of elementary habits in the ab- 
sence of stimulus generalization, secondary reinforcement, etc., from 
previous learning. Such is the case in most experimental studies of animal 
behavior. The laboratory rat, for example, never in its life encounters any- 
thing like the lever in a Skinner box until it is the subject of an experiment 
on learning. Thorndike’s cats had never before been confined in boxes from 
Which it was possible to escape by pulling on strings. Therefore, the correct 
responses had to be built up from scratch. 

In other situations unfortunately all too rare in the psychological labora- 
tory, an organized sequence of training converts a naive animal into a rather 
Sophisticated one. In many different ways we have had occasion to refer to 
Harlow’s experiments in which he changed monkeys who made choices on 
Successive discrimination problems in a blind and mechanical fashion to 
monkeys who carefully formed and acted upon hypotheses. These monkeys 
behaved insightfully, and their behavior came entirely from a rigorous pro- 
gram of training. Their hypotheses were the result of “second-order” habits 
of Observing and testing consequences, and they arose because these are the 
common features of behavior reinforced in a number of lower-order habits. 

Does this mean that problem solving is simply transfer from more ele- 
mentary habits? It is unlikely, since if this were the case, the only thing to 
Prevent a rat from behaving as intelligently as one of the great apes would be 
the number and kind of elementary habits it could bring to bear on a given 
situation. There are genuine species differences in a number of important 
Psychological dimensions, and while we do not know all of them, some in- 
Vestigators have isolated a few that must be very important. Among these 
are the differences between animals in the capacity to react to stimuli in 
their absence and to react differently to the second occurrence of a stimulus 
than to the first. These indicate the ability of animals to react symbolically, 
and we shall examine next some problems in investigating symbolic be- 
havior in animals. 


Symbolic Behavior 


It is convenient to divide symbolic behavior into three classes—(1) sim- 
Ple discriminations or classical conditioned responses, (2) reactions to 
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stimuli which are not in the environment but exist within the organism or 
are mediated by learned cues, and (3) linguistic reactions in which the or- 
ganisms recode portions of the environment for action in the absence of 
the particular cues. It hardly need be said that, so far as we know, this third 
variety of symbolic behavior is limited to human beings. 

Simple discriminations serve to lead the organism from place to place. 
The formation of symbolic representations in this sense is nothing more than 
learning in its most basic aspect. The second function of symbolic behavior 
is to bridge a gap in the absence of external stimulation. This we can call 
representational symbolic behavior. The higher animals evidently can pro- 
duce their own symbols, so that they can respond appropriately even after 
the external stimulus has been removed. The third function, the linguistic 
function, enables organisms to manipulate the environment symbolically. 

It is the second function which is of basic importance to the comparative 
study of problem solving. If animals solve problems by testing hypotheses, 
it is essential that they be able to represent to themselves more than the 
environment present at the moment. To the extent that there are differences 
among species in this ability, there should also be differences in ability to 
solve problems insightfully. Consequently, many investigators have exam- 
ined animals at different positions in the phylogenetic scale for their ability 
to react to stimuli in their absence. Some evidence on this question is pre- 
sented below. 


The Delayed-reaction Experiments 


W. S. Hunter was one of those who realized the importance of examining 
symbolic action in the problem-solving behavior of animals. Hunter (1924) 
devised and examined several tests which he thought would force animals 
to react to stimuli in such a way that the only inference one could reason- 
ably make was that symbolic representation had occurred. Some investiga- 
tors have disagreed with Hunter in this matter, so it will be necessary for 
us to examine these tests in some detail. The first is the delayed-reaction test. 

There are two basic methods in this test, direct and indirect. In the in- 
direct method the investigator trains the animal to associate a certain stim- 
ulus with a reinforcement. For example, Hunter presented animals with 
three doorways, one of which was lighted. If the animal chose this one, it 
was reinforced. When Hunter was certain that this association had been es- 
tablished, he would turn the light on, let the animal observe the correct 
door, and then turn the light off while restraining the animal by means of 
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a glass barrier. The animal would be restrained for a period of time and 
then released. If the animal chose the correct door more frequently than 
chance, Hunter thought there was evidence that it could keep a symbolic 
representation of the light. 

In the direct method, pretraining is not necessary. Generally the direct 
method works with primates and higher animals. For example, a monkey 
may be presented with two cups. It watches the experimenter place a raisin 
under one of the cups. The monkey is restrained for a period of time and 
then allowed to choose one of the two cups. 

Hunter tested a variety of animals by the indirect method and found that 
there were marked differences from species to species of mammals. For ex- 
ample, in his original experiments (1912), he found that rats could delay 
a maximum of about 10 seconds, while dogs could delay up to 5 minutes. 
A systematic ordering of the animals Hunter tested indicated that he had 
a powerful technique for differentiating an important mental process among 
the higher animals. 

Not everyone was convinced that the delayed-reaction test permitted an 
unequivocal inference about symbolic representation. Even in Hunter's 
Observations, there were some indications that animals were using con- 
tinuing external stimulation to bridge the gap. In other words, they were 
using substitutes for direct symbolic representation. For example, Hunter 
Observed that his rats, when successful at delay, maintained the bodily ori- 
entation that they had at the time the light went off for the entire period of 
delay. Other animals, such as raccoons, did not need to maintain the cor- 
rect orientation in order to respond. Thus, it seemed likely that the raccoons 
Were more capable of a true symbolic representation than were the rats. 

Under other conditions, however, rats can delay up to four minutes 
Without a maintained postural orientation (McCord, 1939; MacCorquo- 
dale, 1947). Here, animals were prevented from responding to spatial cues 
When the correct stimulus was moved to another door. This procedure for 
Preventing response to spatial cues removes certain objections that some 
Psychologists have had to using the delayed response experiment as an indi- 
cator of symbolic processes (Seward, 1948). Nevertheless, it is possible to 
View delayed-reaction experiments of the sort we have been discussing as 
examples of complex discrimination learning (Seward, 1948). In either case, 
the solution of the problem represents a high level process in the animals 
and one that is probably closely linked with the differential ability to solve 
Problems. 
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The Double-alternation Problem 


Another test that has been generally considered to be something that 
could be solved only by symbolic representation is the double-alternation 
problem. In the double-alternation maze problem (Hunter, 1928) an ani- 
mal is required to learn a maze consisting of two alternative pathways. 
Each pathway describes a square and brings the animal back to the starting 
point. Figure 54 shows an example of such a temporal maze. The animal 
must learn to alternate the pathways in a sequence. In the double-alterna- 
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Figure 54. The floor plan of a temporal maze. In the dovuble-alternation problem the animal is 


required fo run around twice in one direction and then twice in the other direction before being 
rewarded. 


tion problem the animal is required to go twice to one side and then twice 
to the other side in order to receive the reward. The sequence can be ex- 
tended, so that instead of a series such as RRLL before reward, the animal 
must learn a sequence such as RRLLRRLL. 

The rat finds it all but impossible to solve even the simplest sequence 
(RRLL). Raccoons (Hunter, 1928), cats (Karn, 1938), and other animals 
can solve the double-alternation problem. The errors, incidentally, in solving 
such a problem are distributed among the alternatives in much the same 
way they are in the linear-maze problem (Stewart and Warren, 1957). 
Preverbal children find the double-alternation problem difficult, but chil- 
dren who can verbalize solve it easily (Hunter and Bartlett, 1948). 
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It is difficult to see how any animal could solve the double-alternation 
problem as it is seen in the temporal maze, without some central mechanisn 
that can tell the animal where it is in the sequence. The central mechanism 
needed is of a fairly low order, considered from the human point of view, 
since it involves, at the most, the ability to count to two. The double-alterna- 
tion problem, however, seems to be a more clear-cut case of symbolic be- 
havior in the absence of secondary supporting cues than does the delayed- 
reaction problem, since it is almost impossible to find grounds for external 
cues that could help an animal solve it. As Hunter points out, even kines- 
thetic cues must be ruled out in the temporal maze.’ 


The Role of Symbolic Representation 


There is something more to insightful problem solving than simply the 
cumulation of transfer. Some animals, high on the phylogenetic scale, per- 
form in such a way that it is almost impossible to avoid the inference that 
they are capable of symbolic representation. External stimuli become re- 
coded in some way so that these animals can react to cues in their absence 
Or react in such a way that they can keep separate the first and second 
Occurrences of a particular external cue. 

Symbolic representation can have many different functions in problem 
solving. For example, it probably plays an important role in multiple-choice 
Problems which require an organism to remember and reject the alternatives 
already tried and found wanting (see Miller and Frick, 1949). It must be 
fundamental in helping animals to see relationships among external events. 
In the example of insightful problem solving described a few pages back, 
Symbolic representation of the pole while the animal was peering into the 
oblong box would almost seem necessary to an insightful solution to the 
problem. 

Unfortunately, the implications of some of the basic properties of sym- 
bolic representation have never been fully explored. Physiological psycholo- 
Lists who are interested in investigating some of the localized functions of 
the central nervous system have made extensive use of the tests we have 
mentioned. Few investigators, however, have tried to follow some of the 
implications of these tests for problem solving generally. All we can do at 
this stage is to point to the tests themselves and to the differential ability 
among animals for problem solving. 

We know that there are marked species differences in the ability to trans- 


Although in the double alternation of lever pressing, which the rat can solve, 
kinesthetic cues could conceivably work (Schlosberg and Katz, 1943). 
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fer complex discrimination problems (Harlow, 1949; Cotterman, Meyer, 
and Wickens, 1956, etc.) . These probably also reflect underlying abilities to 
use symbolic representation. It is intriguing, though perhaps fruitless, to 
speculate on the nature of the differences in representational factors among 
different animals. Some investigators (Stewart and Warren, 1957) have 
suggested that in many tests of symbolic representation, the differences are 
quantitative rather than qualitative. In the normal adult human being, 
though, there are clear qualitative differences. 


HUMAN PROBLEM SOLVING 


We are sure that normal adult human beings can solve problems sym- 
bolically, because they possess highly developed and organized linguistic 
tools for doing so. Therefore, there is no question whatever about the nor- 
mal adult’s ability to solve problems insightfully. He may not always do so 
although he is capable, and, indeed, a knowledge of this capability is what 
leads us to suspect an analogous process among animals based upon a 
much more rudimentary symbolic process. 

For many years, Piaget and his associates have been studying the devel- 
opment of intellectual processes in children. These studies fill out in rich 
detail the growth of the processes of insight and concept formation in devel- 
oping children. A great deal has been learned about children’s concepts of 
number, space, time, and other fundamental matters from these studies. In 
addition, these investigators have studied the actual process of problem 
solving. For example, Inhelder and Piaget (1955) point out that young 
children solve problems by finding similarities and differences among 
things; their approach to problems is concrete. Adolescents, on the other 
hand, may develop formal propositions; they try to formulate or find ab- 
stract laws for solving whole classes of problems irrespective of their spe- 
cific content. Although it is on a different level and involves quite different 
processes, this description of progression from concrete to abstract prob- 
lem solving is highly reminiscent of the formation of learning sets in Har- 
low’s monkeys. 

An account of such factors, however, is more properly the subject mat- 
ter of developmental psychology. Our task at the moment is to find out what 
we can about the relationship between associative factors, learning, and the 
solution of problems. Therefore, we shall turn to the experimental literature 


aimed at the analysis of problem solving and the relationship between this 
process and mechanisms of learning. 
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Experimental Studies of Human Problem Solving 


The Formation of Hypotheses 


Hypotheses are not random. When a normal adult is faced with a prob- 
lem to solve, his attempts at solution are not random samples of the pos- 
sible behavior in which he might engage. For that matter, the cat in the 
puzzle box does not really behave randomly either. Certain classes of re- 
sponses have preference over others for both man and cat. The behavior 
of the cat is more likely to be characterized as trial-and-error behavior 
simply because, in all likelihood, it will be less systematic and more random 
than the ideal strategy for solving a particular problem. As we shall see 
shortly, men do not always or, indeed, frequently employ an ideal strategy, 
but their behavior is more likely to be systematic than the behavior of the 
cat or monkey. In this sense, man is less likely to engage in pure trial-and- 
error behavior than are any animals. 

There are many ways in which the behavior of men and that of animals 
differ qualitatively during problem solving. An animal, for example, is 
much more likely than the normal, unruffled adult to repeat over and over 
again a response which has already been tried and found not to lead to solu- 
tion. The range of possible attempts to solve a particular problem will, in 
general, be less in animals than in man. Finally, man, by his ability to use 
language, can much more effectively recode the problem into more familiar 
terms or he can exhaust systematically the possibilities for finding a solu- 
tion. 

| Thus, the hypotheses men use in solving problems are not random selec- 

tions of the ones they might use. They tend to be determined by rather sys- 
tematic factors. Cofer (1954) points out that some of the most important 
determining factors in human problem solving are mediating verbal re- 
Sponses.* These are responses that the individual is not likely to make 
Overtly; they are implicit verbal responses of the sort we usually mean 
When we talk about thinking. The important verbal mediating responses 
Which lead to attempted solutions of a problem may not be directly elicited 
by the problem situation but may be only indirectly elicited by virtue of 
Association with responses directly produced by the situation. In the next 
few pages we shall examine some of the ways in which verbal mediating 
Processes influence problem solving. 


2 2 Also see Osgood (1953) for a detailed account of the theoretical role of media- 
tional processes in learning and thought. 
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Verbal mediating responses and direction. The Way in which verbal 
mediating responses can give direction to hypotheses in problem solving is 
illustrated in an experiment by Judson and Cotfer (1956). In this study, 
subjects were given groups of four words and told to indicate the one word 
not related to the other three. Suppose, for example, that ADD, SUBTRACT, 
MULTIPLY, PER CENT were given. This item is easy. It is Obvious that PER 
CENT is the word that does not belong, since it is the one word that does 
not indicate an arithmetic operation. Suppose, however, that the words 
Were ADD, SUBTRACT, MULTIPLY, INCREASE. Here the answer is not so easy. 
To be sure, the same three arithmetic Operations are present and INCREASE 
is not an arithmetic operation. But ADD, MULTIPLY, INCREASE all belong 
to a class of words indicating increasing magnitude, while SUBTRACT 
belongs to a class of words indicating decreasing magnitude. Perhaps SUB- 
TRACT is the word that does not belong. 

Judson and Cofer administered many such ambiguous items to subjects. 
The object was to find out which ambiguous word the Subjects would accept 
as belonging with the other words. One of the things they discovered was that 
the order of the words was important. Thus SUBTRACT, INCREASE, MULTIPLY, 
ADD would more likely lead to INCREASE being rejected, whereas MULTIPLY, 
INCREASE, ADD, SUBTRACT would more likely lead to SUBTRACT as the re- 
jected word. 

Secondly, subjects were likely to accept ambiguous words that con- 
formed to their previous habits. For example, in problems like PRAYER, 
TEMPLE, CATHEDRAL, SKYSCRAPER, Subjects with strong religious interests 
were more likely to accept the word PRAYER and reject SKYSCRAPER than 
subjects with little religious interest. Thus it seemed clear to Judson and 
Cofer that the direction chosen by the subject was the result of activation 
of mediating responses which recoded the material in a particular way. The 
Probability of a particular mediation response is determined in turn by its 
relative associative strength. This point will become quite clear in later 
examples. 


Fixation in Problem Solving 


Fixation by mediating responses of function. Ifa particular problem 
elicits a strong set of mediating responses which point to a particular solu- 
tion, and if that solution is incorrect, we have a case of fixation without ade- 
quate solution of the problem. The Subject becomes fixated on the wrong 
response and cannot switch to the right one. Frequently problems are pre- 
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sented to us in such a way that they do elicit the Wrong mediating responses, 
and great difficulty with the problem results. 

This point is illustrated by some of Duncker’s (1945) well-known ob- 
servations on problem solving. Duncker gave students a Variety of simple 
Problems to solve. One of them was as follows: The subjects were required 
to mount three small candles on a door (ostensibly for experiments on 
Vision). The materials the subjects needed for solution were scattered in 
Confusion on a table and mixed in with a number of other objects. The 
Crucial items were tacks and some cardboard boxes similar to small match 
boxes. The solution required the subjects to tack the boxes to the door, and 
then to melt wax on the boxes in order to hold the candles. 

This problem was presented to the subjects in two ways. The difference 
Was small but of critical importance. In one condition the boxes were filled 
With the experimental material—tacks were in one box, candles in another, 
and the matches in the third. In the other condition the tacks, candles, and 
matches were placed loose on the table and the boxes were empty. 

What difference did these conditions make? Those subjects who were 
Presented the problem with the boxes empty all solved it, while only about 
half the subjects were successful when the boxes were filled. What was 
Tesponsible for this difference? It is very easy to understand if we consider 
the possible mediating responses that these two versions of the problem 
elicited, When the boxes were filled, they were more likely to elicit a medi- 
ating response such as “container.” When the boxes were empty, the mediat- 
ing response “container” was likely to occur also, but not nearly so strongly. 
The result was that other mediating responses, among them perhaps “plat- 
form,” were able to occur. The filled boxes thus produced a fixated function 
Which interfered with the proper solution of the problem. 

Duncker studied a number of different problems like the one we have 
Just examined. His experiments were carried out with a good deal of in- 
Senuity, but, unfortunately, they leave much to be desired, since the number 
of Subjects was very small and the experiments were performed under 
Tather informal conditions. Other experimenters who have worked with 
these or similar problems have achieved much the same results, however, 
$0 this functional fixation, as Duncker called it, seems to be a genuine 
effect (see Adamson, 1952). | ) 

f One important point comes out of subsequent experimental Work, Tf time 
'S allowed between the exposure which sets off incorrect fixation and the 
Actual work on the problem, the fixation disappears (Adamson and Taylor, 
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1954). Thus, it is clear that whatever kind of verbal mediating habits are 
responsible for the fixation, they behave much as other kinds of verbal 
habits. 

Other experimental work has indicated the effect of verbal responses 
suggested to the subject upon the direction problem solving takes (Maier, 
1930, 1931). In one experiment, merely teaching the subjects lists of 
words which could arouse associations pertinent to solution increased the 
frequency of relevant solutions (Judson, Cofer, and Gelfand, 1956). Cotfer 
points out that such activation of appropriate solutions probably occurs 
through mediated generalization (Cofer and Foley, 1942). For example, 
if the word “rope” is associated with “swing,” subjects are more likely to 
solve a problem which involves getting two strings tied together by swing- 
ing one to the other by means of a pendulum bob. If, on the other hand, 
“rope” is associated with “hemp” and “pendulum” with “clock,” the pen- 
dulum swinging solution is less likely to occur. 

Riddles, puzzles, and similar brain teasers are based on the probability 
that a particular way of stating a problem will elicit a very strong associa- 
tion that prevents the appearance of a mediating response leading to the 
correct solution. This same sort of thing often makes students complain 
about trick questions on examinations. 

Fixation by method. Jn the examples we have discussed thus far, direc- 
tion of solution has been seen to be determined by verbal mediating proc- 
esses either set off by the problem situation itself or suggested by verbal 
associations given to the subject. Fixation, however, can occur by other 
means as well. Probably one of the most important is the pattern of rein- 
forcement for particular kinds of solutions. This kind of fixation is illus- 
trated by some experiments by Luchins (1942) on a well-known kind of 
problem. 

Luchins asked a group of subjects to solve a number of problems all of 
the same type. Here is an example: Given a 3-quart jar, a 21-quart jar, 
and a 127-quart jar, how would one measure out exactly 100 quarts? The 
solution is to subtract 21 quarts from the 127 quarts, which makes 106 
quarts, and then to subtract 3 quarts twice. The first problems given to the 
subjects always required the use of all three measures for an efficient solu- 
tion. After the subjects had solved six such problems, others were intro- 
duced which could be more efficiently solved with only two of the measures, 
though all three measures could be used. Not surprisingly, perhaps, the sub- 
jects kept on working through these simpler problems by the more compli- 
cated method. Thus the habit built up by the earlier problems persisted, 
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even though it no longer provided the best method. When the subjects were 
instructed to write “don't be blind” on their papers immediately after the 
sixth problem, a much smaller number persisted in the more difficult 
method. 

Such persistence of habits no longer most efficient for solving particular 
problems has been characterized as rigidity, and a number of investigators 
have experimental evidence to indicate that this kind of rigidity is modified 
by the scheduling of problems and hence of the habits that are reinforced. 
For example, on a conceptual card-sorting task, Schroder and Rotter 
(1952) demonstrated that a training series requiring frequent shifts in set 
resulted in much greater flexibility than one that required no shifts. 

Maltzman and Morrissett (1952; 1953) performed a series of experi- 
ments in which they produced various fixations on the solution of anagram 
(scrambled words) problems. They pointed out that the effects produced 
by fixation are easily interpreted as the result of simple habits. As a matter 
of fact, they were able to derive and verify certain predictions about fixa- 
tion from the assumption that the strength of the habits underlying fixation 
is a simple negatively accelerated (exponential) function of the number 
Of previous elicitations of that habit. Remember from the discussion in 
Chapter 2 that such a simple negatively accelerated curve is the most basic 
and elementary kind of learning curve. Thus, there is every reason to be- 
lieve that the habits responsible for fixation are determined by all the basic 
Principles of learning. 

Hypotheses in human problem solving are determined by the arousal 
Of verbal mediating responses, and these are aroused by and arouse in turn 
simple associations. These associations may or may not lead to the correct 
Solution of the problem, and to a certain extent the individual will try to 
direct or restrict associations to areas which have some probability of lead- 
ing to a solution. 

The habits that are aroused by the verbal mediating processes vary in 
Strength. The stronger habits are those that have occurred more frequently 
in the individual's past history, and these have a greater probability of oc- 
curring in any problem situation than weaker habits. Furthermore, they 
depend upon the stimuli with which they have been associated in the past. 
For example, a pendulum bob has a very low probability of eliciting re- 
Sponses appropriate to a hammer (using the bob to drive a nail into the 
wall). Yet, when the appropriate conditions occur, subjects can vary their 
mediated responses to the bob in such a way that it occurs to them to use 
it as a hammer (Duncker, 1945). 
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When a secondary or indirect use of a familiar object occurs, it is prob- 
ably because of either direct primary-stimulus generalization (which would 
be greater, say, if the pendulum bob looked like a hammer head or be- 
cause of mediated generalization. Mediated generalization occurs along 
some learned, or conditioned, aspect of stimuli and is characteristic of the 
relationships between words in a language (Cofer and Foley, 1942). Thus, 
the word “vane” might give rise to its homonym “vain,” which in turn might 
arouse “fruitless” and this could lead to “barren” and then to “dull” or 
“stupid.” By an elaborate intertwining of learned and unlearned generaliza- 
tion between words (considering words as stimuli rather than items of be- 
havior), a great flexibility is added to human behavior. This must be of 
fundamental importance in human problem solving. 

Furthermore, many factors which are not primarily associative determine 
the likelihood of particular solutions or outcomes in problem solving. These 
are so-called dynamic factors in behavior, such as motivation, attitude, and 
personality structure. There is a fairly large literature on the relationship 
between these factors and problem solving. For example, Thistlethwaite 
(1950) was able to demonstrate that students with particular biases con- 
cerning racial issues make more errors drawing inferences from statements 
with racial themes than from statements with neutral themes. The greatest 
number of errors in drawing such inferences does not necessarily occur 
with emotional items that are ambiguous, but rather with those in which 
the individual can see a possible answer compatible with his views. 

Since such attitudes and, to a certain extent, motives are determined by 
associative factors, simple associative factors have great importance in de- 


termining the nature and direction of the attempts people make to solve 
problems. 


Other Aspects of Problem Solving 


We have had nothing to say about two very fundamental aspects of prob- 
lem solving. What is the nature of understanding? How does an individual 
recognize and pick out the correct solution from a set of many available 
possibilities? The discussion of these questions, while basic to an under- 
standing of human thought and problem solving, would carry us away from 
the central issues in the psychology of learning. We mention them at this 
point to make clear that we do not solve all the problems of thought by 
discussing the habit structure of verbal mediating responses. Individual dif- 
ferences in ability to solve problems and to think creatively cannot be ade- 
quately described in terms of simple habits, but an understanding of the 
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role of habit, transfer, and generalization in problem solving gives us a 
grasp of certain fundamentals of the process. 


The Learning of Concepts 


The world presented through our senses is a vast jumbled confusion of 
different sensations. We are able to deal with it only by cutting it down to 
the size of our own mental processes. The primary way we do this is by 
setting up equivalences and identities among separate parts of our expe- 
rience. In short, we categorize and assign names to the categories. Thus, 
the deep maroon color of the book on my desk and the tomato-colored 
stripes in my tie both are characterized as “red.” This process of categoriz- 
ing so that all the infinite variety of the external world may be dealt with 
by our mental processes and language, is one of the most essential elements 
in human thinking. We call this process concept attainment, or concept for- 
mation. In the next few pages we shall examine some of the experimental 
literature on concept attainment and see how special problems in the or- 
Sanization of concepts are related to the basic principles of learning. 


Forming Concepts 

Categorization. Most of us are so familiar with the conceptual cate- 
Sorization of our world that we do not stop to think about the process of 
Arriving at categories. We may even naively believe that our conceptual 
Categories have independent existence. The habit of accepting the reality of 
Conceptual categories may be so firmly established that it will interfere with 
Scientific progress and thinking. Only in the past few years have systematic 
biologists generally come to the view that “species” of animals are not fixed 
Classes into which animals do or do not belong, but rather that “species” 
Are simply convenient classes into which we can group animals with similar 
Characteristics. Nature is often continuous and will not fit the boundaries 
Of our categories. Thus, frequently there is a continuous gradation in essen- 
tial characteristics from one species of animals to another. 

Every college student at one time or another has run into a homely ex- 
Ample of the conflict between categories and the continuity of things. Regis- 
trars and deans demand that instructors categorize students in classes usu- 
ally labeled A, B, C, D, and F. There may be only the faintest difference 
between the student with the highest B and the lowest A, but because hu- 
Man Society and thinking cannot function with continuously graded events, 
And must resort to categorization, We are required to place these students 


if Ae 
1 different classes. 
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We form classes by isolating attributes of things which are to be classed as 
identical or equivalent. There are two ways of doing this. We may take a 
heterogeneous collection of things and examine them for common attributes 
so that we can sort them into classes, or we may come to examine a number 
of things with some preconceived notion about the classes into which they 
ought to be placed. In either case, we assign names to the classes (thus 
recoding our experience). 

Our categorization is usually done in accordance with social reinforce- 
ment. Thus, children learn at an early age to discriminate between dogs, 
cats, and rabbits, though they may make mistakes during the early stage of 
learning to assign the proper names to the individual animals. 

Learning conceptual categories. Osgood (1953) reminds us that the. 
formation of concepts involves verbal mediating responses which permit 
us to recode the continuity of our experience. The classical experimental 
literature on the formation of concepts has dealt with the ways in which 
such verbal mediating responses are established. 

For example, Hull (1920) studied the way in which students learned to 
find identity relationships between stimuli Possessing common elements. 
He presented subjects with Chinese characters paired in a certain way with 
English words. Chinese characters are compounded of certain elements, 
called radicals. These may vary in position or size within the character, but 
Whenever they are present it means that the character has something in 
common with other characters which also contain the same radical. They 
are roughly analogous to certain syllables (mainly suffixes) in English. 
Thus, the words “repeat” and “return” have in common a syllable (actually, 
a bounded morpheme) which carries the connotation of coming back over 
the same event. In Hull's experiment, whenever a certain radical occurred it 
was always combined with a certain English word. The problem for the sub- 
jects was to recognize and associate the radical with the appropriate Eng- 
lish word. 

As might be supposed, such learning is much the same as any other kind 
of paired-associate learning. There were, however, some cases in which 
subjects obviously had learned to associate the proper English word with 
the proper radical, but these subjects could not recognize the radical. Thus 
they were reacting to something they could not verbalize. They had learned 
the correct pairing of the Chinese characters with English words, but they 
had not acquired a mediated recognition of this. 

Other experimenters (Heidbreder, 1924; Smoke, 1932) have studied 
the role of hypothesis formation in the discovery of concepts. Smoke (1932) 
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Teports that his subjects systematically tested and rejected a number of 
hypotheses—thus making use of transfer—until they finally came upon 
one which gave them enough correct responses to be satisfying. Heidbreder 
(1924) emphasized the fact that the learner does not always have to be 
actively engaged in hypothesis formation in order to arrive at a solution 
of the problem. In especially difficult problems some subjects engaged in 
What Heidbreder called “spectator” behavior; these subjects would resign 
themselves to responses which were more or less random and would spend 
their time in passive observation of the material presented to them. Under 
these conditions, subjects eventually solved the problems. Thus a period 
of quiescent observation may be just as valuable as a period of active 
hypothesis formation. 

The attainment of specific concepts. Despite the fact that concept at- 
tainment does not differ qualitatively from other kinds of learning there are 
differences in the ease with which various concepts are learned. 

Heidbreder (1946a, 1946b, 1947) had subjects learn to attach labels to 
the concepts illustrated in Figure 55. From the pictures we can see that 
Some of the concepts were highly concrete (faces, buildings, etc.), and 
others were unfamiliar and abstract (“‘nonsense” forms, numbers, etc.). 
Heidbreder found that there was a regular order in the attainment of con- 
cepts. The concrete-object concepts were the easiest to learn, the nonsense 
form concepts of intermediate difficulty, and the abstract number concepts 
most difficult. Heidbreder attributed this to a special factor she called the 
hierarchy of dominance, which is due to the objectlike character cf per- 
ception. Thus, the concept of “face” is easy to attain, since the instances 
Of it are perceptual entities—in a word, faces. Numbers are difficult to 
attain because the instances of them are embodied in concrete objects 
rather than being concrete objects themselves. Thus, the concept of “two” 
Might be exemplified by two spoons or two rabbits. 

A subsequent series of experiments (Dattman and Israel, 1951) did not 
Confirm Heidbreder because different ways of embodying concepts were 
used. When appropriate instances of numbers or abstract figures are used, 
the order of attainment of concepts disappears. This finding makes it quite 
evident that the ease of attainment of concepts most certainly depends upon 
the stimuli and the relationships between stimuli used to exemplify the 
Concepts. 

It turns out that there is another way to describe the differences in ease 
With which concepts may be attained, and this may be more fundamental 
than Heidbreder’s. An experiment by Baum (1954) clearly shows that the 
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Figure 55. Pictures used in Heidbreder’s experiments on concept formation. Each illustrates a 
concept which is named by a nonsense word. The concept MANKS (sixes), for example, is illustrated 
by the six stars or the six wavy lines. (Heidbreder, 1946b.) 


relative ease with which concepts like Heidbreder’s are attained depends 
upon how much interference there is from stimulus generalization produced 
by similarity of items belonging to different concepts. Baum was able to 
account for all of the differences in ease of attainment of concepts found by 
Heidbreder by the amount of confusion between guesses. For example, sub- 
Jects in Baum’s experiment had greatest difficulty attaining the concept of 
“sixness.” They would frequently give the response appropriate to “three- 
ness” when faced with the concept of “‘sixness.” Thus, there was generaliza- 
tion between the stimuli appropriate to these concepts. We can see, from the 
illustration showing Heidbreder’s stimuli, that six could be easily confused 
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with three, since frequently the stimulus embodying the concept of “six- 
ness” was given in two rows of three Objects each. Whereas the subjects 
found the concept of “faceness” quite easy; they would only confuse it with 
the concept of hat. Thus stimulus generalization (or, more accurately in 
Some instances, mediated generalization) produces confusion between the 
examples themselves, which leads to slower learning of some concepts. 

Again, as we might expect, since concept attainment seems to Obey all 
the rules for other kinds of learning, the rate of attainment of a concept is 
determined by discriminability among the instances of different concepts. 
Therefore, argues Baum, it is not necessary to say that an abstract concept 
is more difficult simply because it has less perceptual thing-quality about it 
(as Heidbreder seems to imply), but rather because an abstract concept is 
Senerally embodied in specific instances that are less easy to discriminate 
from those of other concepts. Thus a child has less difficulty learning the 
Concept of “tree” than that of “weed,” because trees are more discriminable 
from other growing things. 

It is difficult to say whether or not one could ever find a situation which 
clearly made a difference between Baum’s reasons for the variations in 
ease of attainment of concepts and Heidbreder’s, for nearly always, con- 
cepts which are easily seen as intact perceptual objects are easier to dis- 
criminate than concepts which are not concrete things. Indeed, it may be 
that Baum and Heidbreder are putting forward much the same idea. Baum’s 
Version seems to be the more general one, however, and it relies on ideas 
derived from more elementary problems in learning, whereas Heidbreder’s 
Notion is a specific hypothesis advanced to account only for the data on con- 
Cept attainment experiments. 

Positive and negative instances. In learning about concepts we may 
Profit from two kinds of examples, positive and negative. Positive examples 
Are instances of the concept and include the essential characteristics. Nega- 
tive examples are instances of what the concept is not and may lack one or 
More of the necessary characteristics. Thus a physician may diagnose a 
Particular disease if a high fever, spots, and an erratic pulse are present. 

> however, only the high fever and erratic pulse are present and not the 
Spots, it is not an example of this particular disease. 

In some early experiments by Smoke (1932) the results suggested that 
Negative examples did not materially contribute to the learning of the con- 
cept. In other words, subjects learn by finding out what a concept is, not by 
What it is not. Hovland (1952), however, pointed out that there is generally 
less information in negative than in positive examples, and it is not clear 
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whether the differences obtained by Smoke are because of the smaller 
amount of information or because people just have difficulty in assimilating 
negative examples. Therefore, Hovland and Weiss (1953) performed a 
series of experiments in which the amount of information in positive and 
negative instances was controlled. It turned out that subjects do better with 
positive examples even under these conditions, though some subjects could 
learn entirely on the basis of negative examples. 

‘There are many reasons why positive rather than negative examples lead 
more readily to concept attainment. For one thing, the required characteris- 
tics are directly presented, whereas with negative examples, if the subject 
is to learn what the concept is, he must put together all of the things a con- 
cept is not. 

In everyday life, concepts may be almost impossible to establish by nega- 
tive instances. This is because the ways in which things differ are 50 enor- 
mous that a single negative instance scarcely reduces uncertainty at all. Try 
to describe a simple object such as a book by mentioning all of the things it 
is not. It is possible in the experiments by Hovland and Weiss in which 
negative instances were designed to tell as much as positive instances, that 
the subjects did not do as well with the negative ones simply because they 
were not used to dealing with concepts in this manner. A machine could 
be constructed which would solve the Hovland and Weiss problems just as 
easily either way. It is clear, however, that human beings have great diffi- 
culty with negative examples even when the information presented is con- 
siderable The fact is, though, that when forced to it people can arrive at 
concepts through negative instances. 

Furthermore, as Bruner, Goodnow, and Austin (1956) point out, many 
concepts must be arrived at largely through negative instances. This is 
characteristic of the attainment of disjunctive concepts. Disjunctive con- 
cepts are those in which one or another attribute may be used to identify 
the concept. These authors give as an example the difficulty in determining 
substances responsible for an allergic reaction in an individual. A food 
allergist must start by finding out what his patient is not allergic to and 
from there try to arrive at a class of allergy-producing substances by many 
negative examples and perhaps a few positive ones. 


Strategies in Concept Formation 


People do not arrive at hypotheses in problem solving by random selec- 
tion. Rather they employ strategies, though they need not necessarily be 
aware of committing themselves to particular plans. Strategies are arrived 
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at by deciding which class of responses one wishes to try in solving a prob- 
lem or attaining a concept and by deciding in what order these responses 
are to occur. 

For example, I forget my keys one morning, and I must borrow a set 
from the janitor to get into my office. The keys he gives me are all alike, so 
I must decide how to find the one that fits. One strategy would be to try 
the keys in order. Another would be to examine the keys for a clue, such as 
code numbers. If I am harried and out of sorts, I may simply jam the keys 
into the lock more or less at random. The last is a strategy too. The prob- 
lem of strategy is an important one, because we know that there must be 
a best approach for every problem. Some investigators have designed prob- 
lems in such a way that they know the best strategy in advance, and these 
experimenters have tried to find out if subjects use the best strategy, or if 
not, which one they do use. 

Bruner, Goodnow, and Austin (1956) have made use of some special 
techniques for studying strategies in concept attainment. In a typical experi- 
ment, a subject would be presented with a set of cards. These cards varied 
in four ways: (1) they had different figures on them; (2) they were of dif- 
ferent colors; (3) there were different numbers of figures on different 
cards; (4) they had different numbers of borders around them. Each of 
these “attributes” had three different values; thus there were three kinds of 
figures, crosses, circles, and squares. All 81 possible cards in this set are 
illustrated in Figure 56. 

The experimenter would form a concept built on selected “values” from 
these attributes. For example, the concept might be green circles with two 
borders. Thus all cards with both green circles and two borders (conjunc- 
tive concept) would be correct. The subject's task was to find out what 
the concept was by choosing a card and asking the experimenter whether 
this was an example of the concept or not. What strategies do people fol- 
low in choosing the cards? 

Kinds of strategies. First of all, there is a logically best strategy. In the 
Problem we outlined there are 255 possible concepts. A first positive card 
dliminates 240 of these, and there is a similar value for all the positive and 
Negative instances thereafter. A logical computing machine could keep 
track of all of the possibilities logically eliminated and solve the problem 
in the minimal number of trials. A little reflection will make it clear, how- 
ever, that such a strategy places an almost impossible burden on the human 
Memory and the capacity for assimilating information already obtained. 
Thus it is very unlikely that anyone would use this completely logical ap- 
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Figure 56. The instances used by Bruner, Goodnow, and 
varied in (1) shape, (2) color, and (3) number of borders. There 


(e.g., color was either green, red, or black). (Bruner, 
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proach. Some subjects, however, do make a reasonable approximation to 
the logical strategy.3 

Another strategy the investigators called conservative focusing, and it 
consists of choosing the test cards so that a change in just one attribute is 
made after the subject happens onto a positive instance. For example, sup- 
pose the subject chooses a card with two green circles and two borders. 
He might then decide to concentrate on the attribute of shape, and for his 
next card choose another shape to test (for example, two green crosses with 
two borders). Or finally, the subject might take a gamble and change two 
or more attributes at the same time. This the investigators call focus gam- 
bling; it is gambling because, while the subject may solve the problem in 
fewer trials than with conservative focusing, he may also take longer. 

The use of strategies. In general, as we might suppose, subjects do 
better with conservative focusing than with logical strategy (or an approach 
to it). The superiority of the focusing strategy becomes greater when the 
Strain placed on memory and assimilation becomes greater. People who try 
to approximate the logical strategy do more poorly as conditions become 
more difficult. If the subjects are limited in the number of choices they are 
Permitted to make (imposed by telling the subject that he can have only 
four or perhaps even one choice to try to solve the problem), the subjects 
tend to shift over to focus gambling. Finally, subjects change their strategy 
as they move from problem to problem. Subjects who have had many posi- 
tive examples in earlier problems gradually shift to a focus gambling strat- 
egy, while those subjects who have had many negative examples shift more 
toward the conservative, one-change strategy. Thus, concept attainment 
Strategies are altered by the subject’s subjective estimates of changes in the 
Probability of payoff. 

Thus, even such an extended and general characteristic of behavior as 
the use of strategy in problem solving and concept attainment is determined 
by the pattern of previous reinforcement. Also individual habits, less di- 
tected by payoff than by the verbal mediational activity characteristic to 
one’s life and work, influence strategy. Bruner, Goodnow, and Austin point 
Out that a few of their subjects of very superior intelligence and usually with 
Some mathematical training tried to attain the concepts by a purely logical 
Strategy. This was an error in the sense that it failed to meet the objectives 
Of a workable strategy. Thus the mediational processes that directed these 


8 An experiment by Whitfield (1951) shows that the same limitations of assimilation 
and memory prevent human beings from being perfectly efficient in the solution of 


trial-and-error problems. 
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people to attempt a purely logical strategy failed to produce adaptive be- 
havior. 


Learning and Thinking 


In conclusion, we can point out that the intimate relationship between 
learning and thinking or problem solving does not mean that efficient think- 
ing is entirely determined by associative mechanisms. It is, however, cer- 
tainly determined to a considerable extent by associative learning, and 
furthermore the limitations of associative learning provide very real and 
important limits to our ability to solve problems. We cannot behave like 
logical computing machines, because we do not have the ability to scan 
our memories almost instantaneously and pick out and act upon the appro- 
priate stored information. We can carry in our heads a limited amount of 
information, and we are dependent upon external cues and the situations 
of the moment for our ability to extract and act upon any piece of informa- 
tion in our memory. Furthermore, from the logical point of view our storage 
systems are haphazard and inefficient. We recode things in a hit-or-miss fash- 
ion; we may rely on a nonsense rhyme, for instance, to help us remember 
the order of the cranial nerves for an examination in comparative anatomy. 

We trade order and magnitude for flexibility, however. A computing 
machine can operate on only one set of instructions. It is saddled with the 
strategy set into it by its operators. Its human operators, on the other hand, 
can move here and there. They can make guesses and govern the reason- 
ableness of their guesses by an intuitive and inefficient, but effective, calcu- 
lation of subjective probabilities of payoff. These guesses and the strategies 
for dealing with them are learned, and the ability of human beings to make 
effective guesses in working out problems is determined by the pattern of 
past associative learning. 

At the same time we are at the mercy of our prejudices and beliefs. A 
large experimental literature is devoted to the demonstration that critical 
thinking and logical reasoning among individuals often turns out to be a 
reflection of prejudice and earnestness of belief. 

These dynamic factors, as we have seen, are an important part of asso- 
ciative learning. Motivation influences behavior, and it is behavior, overt or 
implicit, that we learn. Therefore, we learn what our motives dictate that we 
learn. In part, this means that our thinking processes are indirectly de- 
termined by the influence of dynamic factors on learning, but it also means 


that these processes are directly determined by the same factors that de- 
termine learning and memory. 


CHAPTER 12 


THEORY AND 
APPLICATIONS OF LEARNING 


Although we have examined many specific theoretical problems in the 
Psychology of learning, we have not yet taken the opportunity to look at 
some of the more general theoretical questions. The purpose of this chapter 
is to Survey some of these broader aspects of learning. 

In the course of examining particular topics such as partial reinforce- 
ment, we have seen some of the hypotheses that have come out of various 
theories of learning. In this chapter, rather than concentrate on these 
limited hypotheses, we shall look at some of the contrasting strategies of 
theories of learning and how these came to be. In addition, we shall want 
to consider the relationships of the study of learning to other biolog- 
ical and social disciplines. For example, the neurophysiological basis of 
learning is one of the most fundamental problems in physiological psychol- 
0gy, and we could not consider our survey of learning complete without at 
least a glance at current neurophysiological theorizing. Another important 
question concerns the relations between animal behavior in nature and the 
facts and theories of learning. We should be neglecting something funda- 
mental if we did not briefly survey some of the current work on this topic. 
On the other side of psychology, we have yet to consider the psychology 
of learning in relation to personality theory, social behavior, and the educa- 
tive process. These are topics we shall need to cover. 

This chapter, then, will include a summary of the theoretical background 
Of the Psychology of learning and of the relationships between the study of 
learning and other biological and social disciplines. In the first section we 
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shall consider learning theory and after that, turn to a survey of the broader 
implications of the psychology of learning. u 


LEARNING THEORY 


Psychology generally, and particularly the psychology of learning, has 
long been noted for the exotic variety of its theories and the bitter argu- 
ments of their exponents. For several reasons, however, we shall not ex- 
amine each of the classical and current theories of learning. First of all, 
this task has already been done in a very complete way by Hilgard (1956), 
and secondly, the grand, universal theoretical controversies are somewhat 
out of date. Not too many years ago, a psychologist interested in learning 
was identified by his adherence to one current theory or another, and these 
theories were “opposed” to one another from the ground up. Theoreticians 
(and experimentalists) spent much time in fruitless argument with one 
another, either by damning the logic of an opposing theory or by attempting 
to demolish it through the performance of “crucial” experiments. 

Such theoretical argument has not entirely disappeared today for the 
good reason that hardly any of the basic issues have been settled to every- 
one’s satisfaction. The strong emphasis upon large scale theories of be- 
havior and devoted adherence to one theory or another have disappeared. 
Currently, psychologists in learning are more likely to quarrel about meth- 
ods ‘of approach and about areas fruitful for research than about universal 
theories. Since it is recognized that there is no theory of learning currently 
available which is able to “explain” all the facts of learning, there is little 
talk about the exclusive and ultimate correctness of one or another theory. 

One of the things likely to arouse an argument among psychologists is 
the question of the relative merits of trying to derive from rigorously quan- 
titative deductions limited hypotheses about highly specialized problems or 
trying ‘to sketch in a rather intuitive fashion a qualitative description of 
fundamental problems and those of general interest. Thus, one group of 
Psychologists may be interested in rigorously deriving a mathematical theory 
of the effects of partial reinforcement on the behavior of the rat in the 
runway, and another group may be interested in trying to find the main 
elements in the relationship between associative learning and problem 
solving in intelligent human adults. There is really little to argue about be- 
tween such groups, except the relative merits of the problems they have 


chosen to work on, since the problems themselves will only at some future 
date come to a common ground. 
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In this section we shall examine a few of the approaches that are currently 
important in the theory of learning. This will not be an exhaustive account, 
since we have space only to discuss a limited number of problems and 
methods that are of interest to theorists at present. 


The Quantitative Approach 


The Hullian School 


In the background of the main currents of attempts to derive theoretical 
Principles of learning by mathematical techniques stands the work of the 
late C. L. Hull. His chief efforts in the theory of learning are only in a very 
primitive sense mathematical, but he, more than any other worker, alerted 
Psychologists to the possibilities of applying mathematics to quantitative 
problems in learning. 

Hull's approach. Hull’s method! was to make a number of postulates 
and see what theorems could be derived from combinations of these. Postu- 
lates are fundamental statements not directly testable by experimental 
means. Theorems, which are derived by rigorous logic from the postulates, 
are experimentally testable, however. Thus, one of Hull's postulates con- 
Cems a stimulus trage, or afterdischarge, as the result of stimulation. Such 
a trace cannot be directly measured. Its properties in conjunction with other 
postulated properties lead to predictions about the form of the relationship 
between ease of conditioning and time intervals between stimuli. This rela- 
tionship should be testable. 

Though this approach provided the basic framework for Hull's work, in 
actual practice his theorizing fell considerably short of his aims (Koch, 
1954). Very often, he slipped into attempts to demonstrate the adequacy 
of his postulates by citing experimental evidence to show how “reasonable” 
they were. His deductions were sometimes contradictory and in logical 
error. Furthermore, his theory seemed more quantitative than it really was. 
This statement deserves some further comment. 

Hull's quantitative theorizing consisted mostly of putting his basic postu- 
lates in the form of mathematical equations. He did little in the way of 
deriving theorems by mathematical means from these postulates. Ordinarily 
it is the mathematical derivation of theorems from postulates that people 
mean when they talk about mathematical theories. Hull simply states an 
equation and lets it go at that. Since there are almost limitless possibilities 


1 The best statement of Hull's beliefs about the method and philosophy of science is 
in his Principles of Behavior (1943). 
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for the choice of equations for each situation, his efforts in this direction 
appear arbitrary to say the least. 

An even greater difficulty with Hull's quantitative theorizing is the fact 
that his basic equations seem to lead nowhere. Often, psychologists state 
relationships in a quasi-mathematical form. Thus we may have heard “ten- 
sion is directly proportional to the amount of conflict.” Such a statement 
implies that the relationship between tension and conflict can be written 

T=KkC 
Now there is nothing wrong with this statement and the equation except 
that we have no way to measure either tension or conflict in such a way that 
we can confidently test an apparent linear relationship between them. There 
is just no way to measure quantitatively either amount of tension or amount 
of conflict. 

This inability to measure many of the concepts Hull talked about was 
part of his difficulty. His most important hypothesis about learning is that 
of habit strength, which is an intervening variable not directly measurable. 
In order to make a quantitative statement about habit strength that can be 
tested experimentally, it must be related to direct measures of behavior 
such as the amplitude of a galvanic skin response. Since there are an enor- 
mous number of measures of behavior that can be used to evaluate learning, 
there must be an equally large number of arbitrary “postulates” needed to 
relate habit strength to behavior. Hull made valiant efforts to solve this 
problem (Hull, et al., 1947), but his attempts were so conspicuously un- 
successful that they reflected on the adequacy of his whole theory. 

Hull’s method provided two important lessons for subsequent quantita- 
tive theorists. (1) Given the present confusion over basic problems of 
measurement and fact in psychology, a rigorous postulational approach can 
work only by making each theory cover a very limited area. A compre 
hensive deductive theory of behavior is practically impossible at present. 
(2) The mathematical relationships in the theory must be developed from 2 
very few definitions and postulates. The proliferation of arbitrary mathe- 
matical equations simply leads to confusion, both in the theory and in the 
experimental test of the theory. 

Hull's contribution. Although we have been critical of Hull's method, 
it should not be assumed that all his work was futile. In the first place, he 
did introduce learning psychologists to quantitative theory. Secondly, and 
more importantly, Hull fixed for many years the basic problems and con- 
cepts of learning theory. Hull made a fruitful union of mathematics, classical 
association theory (see page 309), and motivation theory—and out of this 
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came our basic contemporary concepts of drive, stimulus generalization, 
secondary drive, etc. Thus, to a considerable extent we owe the basic con- 
ceptual framework of the psychology of learning to Hull. Workers in mo- 
tivation theory, in the physiology of motivation and learning, and in per- 
sonality theory—all use concepts which were either introduced or made 
popular by him. 

Not the least of Hull’s contributions was in forcing other theorists to 
come to grips with him. The long-standing controversy between Hull and 
his associates and Tolman and his associates led up many a blind alley, but 
out of it came important knowledge. This controversy contributed to the 
clearer understanding of the separation of motivational and learning vari- 
ables and to the understanding of the role of reinforcement in controlling 
behavior. Therefore, in a very real sense, Hull stands at the origin of much 
of modern behavior theory. 

Work in the tradition of Hull is very much alive. K. W. Spence has at- 
tempted to correct many basic defects in Hull’s theory. He has severely 
limited the applications of his fundamental postulates to very special prob- 
lems in behavior. He has, at the same time, attempted to simplify Hull's 
basic theory and make it more amenable to ordinary mathematical treat- 
ment. In a recent book (Spence, 1956) he has presented his own theory of 
learning, and this probably stands as the final and culminating work of the 
Hullian school. 


Statistical Learning Theory 

The behavior of animals and men is variable. Learning curves are irregu- 
lar. Most earlier learning theorists regarded this variability as a nuisance 
Which could be minimized by careful experimental control and by studying 
behavior only in the most circumscribed situations. There exists, however, 
a mathematics of variability, or randomness, and it was inevitable that 
Sooner or later psychologists would apply some of the mathematics of ran- 
domness to the study of behavior. Hence, the statistical theories of learning. 

The basic notion behind any statistical theory is that the events one 
measures in the laboratory are the result of the influence of a whole popu- 
lation of elementary random events. These are events which cannot be pre- 
dicted; a good example is the unbiased toss of a coin. If we toss a penny 
enough times it will come out heads about 50 per cent of the time and tails 
about 50 per cent of the time. We can think of the percentage of heads as 
the “result” measured in the laboratory and each individual toss of the coin 
as one of the elementary events that goes to make up the result. In statistical 
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learning theory, postulates are stated about the distribution of elementary 
random events, and measurable results are predicted from a mathematical 
development of these postulates. 

In recent years a number of theorists have developed statistical learning 
theories. The theories of Bush and Mosteller (1955), Estes and Burke 
(1953), and Restie (1955) are all rather similar to one another in method 
and outcome. Estes and Burke (1953) have called theirs a theory of “‘stim- 
ulus variability.” The essential notion they use to derive the basic curves for 
classical and instrumental conditioning is that each time a response occurs, 
it becomes associated with a sample of all of the possible stimuli that are 
likely to impinge on the organism on successive trials. We can think of the 
sample as a random sample of stimulus elements that manage to get through 
the sensory input of the organism at the time a particular response occurs. 

Notice that in this theory, learning is an all-or-nothing affair. The sample 
of stimulus elements that occurs simultaneously with any given response 
becomes conditioned to that response. The reason, according to the theory, 
that learning curves in practice are gradual and irregular is that different 
samples occur on successive trials. These samples will overlap to some ex- 
tent, so after an initial trial some of the stimulus elements will already be 
conditioned. Others will not, however, and these become conditioned on 
that trial. We can see that if the population of stimulus elements were a 
limited one, eventually so many elements would have been conditioned that 
there would be none left to condition. Thus, conditioning approaches a 
limit of 100 per cent. An intuitive idea of how the cumulation of learning 
occurs can be derived from Figure 57. 

The strategy of statistical learning theories. Statistical learning theorists 
have, to a considerable extent, avoided the problems which plagued Hull. 
Their postulates are for the most part elegantly simple. Bush and Mosteller 
(1955), for example, are able to state an equation which implies that the 
change in probability of a response will be determined by two parameters, 
one associated with reward and the other with punishment. The rest of the 
mathematical development of their theory is given over to the applications 
of their basic equation and the definition of events, such as trials, and 
measurements, such as latency. 

Furthermore, statistical learning theorists are not bothered by the cumber- 
some necessity of transforming equations which relate independent variables 
to intervening variables into equations which relate independent variables 
to dependent variables. The equations are all stated in such a form that 
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there is only one fundamental measure of behavior, the probability of occur- 
rence of a particular response. All other measures, such as magnitude of 
response and latency of response, are derived from definitions of probabil- 
ity of response. Therefore, the statistical learning theories have the consid- 
erable merit of suggesting a fundamental measurement in psychology, the 
measurement of probability or, empirically, the relative frequency of occur- 
rence. This has found widespread acceptance among theoretical psycholo- 
gists, for a variety of efforts in psychology have recently converged on the 
common notion that the basic measurement of behavior must be in terms of 
probability of occurrence of different responses. 


POPULATION OF STIMULUS ELEMENTS 


Figure 57. An illustration of two successive trials according to statistical learning theory. Each dot 
represents one of the “elements” of the learned act. The dots enclosed by trial 1 are present and 
hence conditioned on the first trial. The dots enclosed by trial 2 are present on the second trial. 
Notice that some of these dots are already conditioned, so that the second trial adds fewer ele- 


ments to the strength of conditioning. 


The limitations of statistical learning theory. The elegance and sophisti- 
cation of statistical learning theories should not blind us to certain defects. 
One concerns the kinds of behavior the proponents of these theories have 
chosen to study. To a considerable extent, these theorists have gone over the 
same ground covered earlier by the Hullian school. This has meant that 
they have not discovered much that is exciting or new. 

Strangely enough, where these theorists have ventured into new fields, 
there has been considerable dissatisfaction with the results. One field that 
has been invaded to quite an extent by the statistical learning theorists is 
the study of “guessing behavior” in human beings. The experiments on such 
behavior are familiar to us, since we reviewed them in the section on partial 
reinforcement. It is the fact that these problems grew up as analogies to the 
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effects of partial reinforcement in animal behavior that has led to the attempt 
to deal with them by statistical learning theories. 

In the guessing experiments, subjects face two lights mounted on a board. 
The subjects are to guess while one light is on whether the other one will 
come on also. The experiment is arranged so that this will occur a certain 
number of times, though the individual occurrences are nearly random. In 
this experiment, subjects are likely to guess, after much practice, about the 
same percentage as the actual occurrences of the light. This effect can be 
derived from statistical learning theory. Unfortunately, however, this same 
effect can be derived by a number of other more complicated, but perhaps 
more reasonable, theories. Furthermore, the effect does not always occur. 
It is not the rational strategy, and statistical learning theory has a most un- 
convincing way of handling the rational strategy. 

Some experiments have been designed specifically to test the implications 
of statistical learning theory for guessing behavior (Estes and Straughan, 
1954). These experiments are very limited in scope, however, and the test 
of the theory consists of curve fitting based on empirical constants, a tech- 
nique rather more likely to lead to success than failure. 

These guessing-game experiments are likely to strike the outsider as 
rather sterile and without significance to the basic problems of human be- 
havior. As models of human decision making they have a certain merit, but 
the statistical learning theories so patently ignore some basic facts of human 
behavior (the ability of human beings to remember several guesses previous 
to the one they have just made, for example) that the use of these experi- 
ments as tests of the theory seem to lead nowhere. The theory and the ex- 
periments are, therefore, much more convincingly applied to the behavior 
of rats. 

A somewhat different approach gives promise of some merit. By using 
many of the same notions as those used by the statistical learning theorists, 
Miller and McGill (1952) have been able to construct a small theoretical 
model of free recall of verbal items. This theory assumes that the probability 
of recalling a word on any trial is completely determined by the number of 
times the word has been recalled on previous trials. Such an assumption 
appears to be quite reasonable as a first approximation to a quantitative 
theory of verbal recall. Miller and McGill examine the implications of sev- 
eral levels of complexity of the theoretical model, and again by curve fitting, 
the theory seems to fit some data (it should be mentioned, however, that in 
this case the theory leads to a rather novel way of treating and presenting 
data). Unfortunately, this theory must introduce some assumptions that 
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are patently contrary to fact. For example, it must assume that the different 
words used all have independent probability of being recalled. 

In general, both the weakness and strength of statistical learning theory 
lies in the introduction of oversimplification. It is a strength because it sug- 
gests fundamental variables which need clear and explicit treatment before 
we can go on to the more complicated problems, and it is a weakness be- 
cause the behavior predicted by the theory is limited to special and exotic 
cases only remotely related to the behavior of real men and animals. 


Two General Approaches to the Psychology of Learning 


Association Theory 


If the human being is the featherless biped, he is even more character- 
istically the verbal animal. The major effort in trying to understand the 
nature of learning in man has been directed toward verbal learning. Experi- 
mental study of verbal learning has been enormously influenced by the tradi- 
tion of associationism, and in order to understand some of the current pre- 
occupations of psychologists interested in verbal learning, we need to look 
briefly at classical associationism and how it has influenced current work. 

Actually, associationism provides the deepest current in the psychology 
of learning, and nearly all theorizing has either been directly in the associa- 
tionistic tradition or influenced by it. Association theory has its beginnings 
in Aristotle, and it runs very strongly through the history of British philoso- 
phy from the seventeenth to the nineteenth century. The doctrine of the 
“association of ideas” was designed to provide a description of the conti- 
nuity of mental life. Ideas provide the units, or elements, of mental life, 
and they follow one another in a constant chain. One particular idea follows 
another one because, sometime in the past, these ideas have occurred to- - 
gether. Or one idea elicits another one because they are similar or con- 
trasting. 

Classical association psychology has disappeared, but the idea of the 
discrete association of elements of behavior by contiguity and by similarity 
(stimulus generalization) provides some of the fundamental elements of 
the modern psychology of learning. 

Relationships between learning, motivation, and incentive conditions in 
modern psychology have much of their origin in the amalgamation of asso- 
ciationism and hedonism put forward by the British philosophers. Locke, for 
example, reminds us that the ideas that make the deepest and most lasting 
impression are those accompanied by pleasure and the relief of pain. Thus 
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he says, “the great business of the senses being to make us take notice of 
what hurts or advantages the body, it is wisely ordered by nature that pain 
should accompany the reception of several ideas.” 2 

We have seen in earlier chapters that Ebbinghaus successfully translated 
the doctrine of association of ideas into experimental work on the higher 
mental processes. Since Ebbinghaus’s time there has been a continuous 
effort to expand and elaborate the experimental analysis of the associative 
processes. From Robinson (1932) to McGeoch (1942) to Underwood 
(1957), the experimental work on association has contributed to the scien- 
tific understanding of human nature. 

Why association theory? Why has so much attention been paid to asso- 
ciation theory by psychologists interested in verbal learning? Perhaps the 
most important reasons why associationism is an ancestor of so much of the 
contemporary psychology of learning while other philosophic doctrines 
have disappeared are (1) its emphasis upon cause and effect and the chain- 
ing of ideas and (2) its analytic nature. Since ideas and the words that 
accompany them can be separated one from another and because they can 
be combined in every imaginable Way, it is possible to treat arrangements 
of them as independent variables. Independent changing of the controlling 
variables is the cornerstone of the experimental method, and classical asso- 
ciation theory provided a framework for the experimental investigation of 
verbal behavior. 

“ Thus the hallmark of associationism in the modern study of verbal learn- 
ing is the independent variation of the stimuli or Words presented to the 
learner. We put nonsense syllables, etc., together in different ways and see 
What effect the different arrangements have upon the behavior of the learner. 
In so doing, we find justification for emphasis upon contiguity, repetition 
(the “law” of association added by Hartley), and similarity in determining 
how behavior will organize itself. With the additional consideration of fac- 
tors of motivation and incentive, a complete Program for the experimental 
investigation of verbal learning emerges. 


The limitations of the associationistic approach. The classical associa- 


tionistic approach to the study of verbal learning and memory is exemplified 


by the methods of rote serial learning. These methods are designed to exert 


the greatest control over the learner; they force him to associate items 


according to the design of the experimenter, not according to his own 
Wishes. The result is that the typical experiment on the verbal associative 


2 John Locke, “Concerning Human Understanding,” in E. A. Burtt, The English 
Philosophers from Bacon to Mill, Modern Library, Inc., New York, 1939. 
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Processes gives only a rigid and somewhat artificial model of the learning 
Processes as they are likely to occur under natural conditions. 

The modern approach to associative learning, however, has profited from 
the continuing criticism of dissenting psychologists and the use of new tech- 
niques. The result has been a growing flexibility in the study of associations. 

For example, information theory has provided a really fundamental tool 
for the expansion of the techniques for studying and understanding verbal 
associations. Information theory reminds us that associations are not all-or- 
nothing in character, but are probabilistic. Thus, in a free-association test, 
the stimulus word “needle” is likely to elicit the response “thread,” but it 
might instead elicit “pin” or “sharp.” The response “thread” has the high- 
est probability, but other words also have finite probabilities of being re- 
sponses to “needle.” Secondly, information theory reminds us that the 
associations exist in probabilistic chains, as we saw in earlier chapters. 
Finally, information theory has Provided a mathematical technique for the 
analysis of associations, and there is currently much interest in the elabora- 
tion of verbal learning theory through both informational mathematics and 
the mathematics of statistical learning theory. 

Associationistic techniques have been expanded and improved as the 
result of criticism and work by gestalt psychologists (see below), so that 
contemporary studies of associations tend to emphasize organization and 
context. Organization of behavior is treated analytically, however, and the 
notions and techniques of information theory, such as dependent probabili- 
ties and recoding, are used in the analysis of organization in verbal be- 
havior. The recognition of intraverbal mediating processes is an important 
step forward and, as we have seen in the past few chapters, has enormously 
increased the scope of associationistic studies. 

Contemporary workers in verbal learning tend to be rather empirical 
in their approach, but there are exceptions to the rule, and since statistical 
learning theory is primarily associationistic in character, we can expect an 
increased interest in the application of statistical learning theory to verbal 
learning. 


Gestalt Theory 


The one tradition in the Psychology of learning that is conspicuously 
out of the main current of associationism is gestalt psychology. Gestalt psy- 
chology is one of the classical grand systems in psychology, and we said a 
few pages back that we should not examine any of these comprehensive 
systems. Nevertheless there have been some important particulars in which 


312 THE PSYCHOLOGY OF LEARNING 


gestalt psychology has profoundly influenced experimental work in learning, 
and we should look at some of these. 

Gestalt psychology and the emphasis on complexity. Most learning 
theorists take the view that we should attempt to find the simplest theory 
to account for a set of phenomena. The gestalt psychologists have argued 
that this approach has led most learning theorists to oversimplify behavior. 
Gestalt psychologists have consistently looked for the complexities in be- 
havior. As we have seen, Kohler was not satisfied with Thorndike’s trial- 
and-error analysis of animal behavior and insisted upon a higher-order 
process of insight. Kohler was right in the sense that Thorndike ignored 
many important aspects of animal behavior, but modern learning theorists 
insist that the examples of insight suggested by Kohler are explainable by 
the action of elementary variables which had already been subjected to 
experimental analysis. 

This seems to be the fate of much of gestalt Psychology, at least in the 
field of learning. These psychologists point out some rather complicated 
and interesting problems in behavior which apparently cannot be accounted 
for by traditional associative mechanisms, and then, after experimental 
work, it is seen that these problems can be viewed as more complicated 
interactions of the fundamental variables of association. 

This seems to be for the very good reason that gestalt notions do not 
readily lend themselves to experimental analysis; they are derived from 
naturalistic observation of complicated behavior and suggest only limited 
means for analysis. The analysis comes from the associationistic tradition. 
Nevertheless, the Problems Propounded by the gestalt psychologists have 
forced associationistic investigators to take into account such things as medi- 
ating behavior, patterning in behavior, and assimilation. 

The influence of gestalt psychology on learning. 
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tively if it is recoded by various kinds of grouping. Some forms of grouping 
may lead to a mediated response of “understanding”; this probably means 
that the person doing the memorizing is able to find a mediated response 
to handle the numbers which belongs to a whole organized set of different 
activities. For example, if the above number is coded properly it can be 
thought of as one hundred and forty-six billion, etc. 

Katona devotes most of his monograph to a direct comparison of rote 
methods and more organized methods of learning and remembering how to 
solve various problems. He brings evidence to show that learning by under- 
standing produces transfer more readily than rote learning. The sum and 
substance of his work is that organization is essential to really effective and 
efficient learning. 

We have already examined a good deal of evidence on the effects of or- 
ganization on memory, and we have looked at some experiments which 
revealed some of the basic reasons why organization benefits learning. It is 
safe to say that, while the recent experimental work on organization and 
memory has not been overtly gestalt in its orientation, it has been much 
influenced by the pioneering work of the gestalt psychologists. 

Wertheimer’s (1945) book on thinking has likewise had the effect of 
stimulating many investigators to work systematically and analytically on 
the problems of organization and understanding in thinking. Thus, gestalt 
psychology has pointed the way to important problems and has suggested 
the really basic variables at work in some of the more highly organized 
mental processes. Nevertheless, the analytic work that has led to disentan- 
glement of the many factors involved in more organized learning and think- 
ing has been basically associationistic in orientation. 

We have already examined some of the work of Kohler that led to the 
notion of insight in problem solving, and we have seen how subsequent in- 
vestigators tried to establish a bridge between the kinds of observations 
made by Kohler and those made under more restricted conditions. In gen- 
eral, then, the influence of gestalt psychology on learning has been to point 
out difficult and intriguing problems in the higher mental processes and to 
suggest general schemata to explain their workings. 

This evaluation of the influence of gestalt psychology is tentative and 
temporary. It may well turn out that gestalt psychology may make funda- 
mental and sweeping contributions to the problem of the relationship 
between behavior and neurophysiology. If so, its influence on learning will 
be even greater and more fundamental. 
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LEARNING AND ITS BIOLOGICAL BACKGROUND 


The ability to learn provides the animal with its most flexible and imme- 
diate form of adaptation. Learning is, in a word, a fundamental biological 
phenomenon. We have not had much to say about the biological aspect of 
learning, since this is usually a subject for physiological and comparative 
psychology. We cannot, however, present a complete account of learning 
without some remarks about its biological background. 


Neurophysiological Theories of Learning 


The changes in behavior called learning are, in the last analysis, reflec- 
tions of changes which must occur within the central nervous system. It is 
not surprising that there has been so much experimental work on the rela- 
tionship between the nervous system and learning. This work, however, has 
not yet solved the basic problem of identifying the changes in the nervous 
system that accompany learning. Consequently this question is left almost 
entirely to theoretical speculation. 

Speculation about neurophysiological events that accompany learning 
Was once a favorite pastime of psychologists and physiologists. Theorists 
were free to invent as they pleased, since ignorance of both learning and 
neurophysiology left ample room for a variety of ideas. The careless and 
uninformed speculation about supposed neural events in learning eventually 
led to disrepute for all neurophysiological theorizing about behavior. Con- 
sequently, a whole generation of psychological theorists all but abandoned 
the nervous system and instead built theories designed to explain behavior 
without reference to neurophysiology. 

Today, both the psychology of learning and the neurophysiology of the 
central nervous system have made considerable advances, and more rea- 
soned neurophysicogical theory is beginning to appear. In this section we 
shall look at some of the current notions about neurophysiological events 
in learning. These ideas are still highly speculative, but we are close enough 
to direct experimental investigation of the matter so that we shall probably 
be able to test some of these notions in the near future. 


Hebb'’s Theory 


Hebb (1949) has put forward an important theory of behavior which 
has had considerable influence in a number of fields, including the psychol- 
ogy of learning. In contrast to most of the psychological theorists of the 
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immediate past, he assigns an important role to neurophysiological hypothe- 
Ses and makes them the central element in his theory. As part of a general 
neurophysiological theory of behavior, Hebb has suggested a theory of 
learning, and from this we can abstract some of the important ideas con- 
cerned with changes in the nervous system. Thus we can see how one Psy- 
chologist has imagined events to take place in the individual nerve cells 
during learning. 

Hebb begins by pointing out that we must have some way of reconciling 
the seemingly incompatible demands of perception and learning. Learning 
apparently requires that specific cells be excited for a specific response to 
be remembered. Perception, on the other hand, seems to demand that pat- 
terns of excitation, rather than specific cells, be the physiological basis for 
Psychological events. We know, for example, that a square looks like a 
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Figure 58. A schematic diagram of a recurrent nerve circuit. 


square whether its image on the retina forms a square or a diamond. Fur- 
thermore, the same square is always immediately recognized as the same 
Square, even if it excites different portions of the retina, and hence different 
portions of the central visual system. 

Hebb solves this dilemma by making learning basic to perception. He 
suggests, for example, that we must learn to perceive the square as a square 
no matter how it is viewed. The problem then remains to postulate the 
manner in which specific cells become the basis for memory. Hebb does 
this by suggesting a dual-trace mechanism— two factors in the development 
of the memory for specific responses. One factor is built on the principle 
Of recurrent nerve circuits (schematically illustrated in Figure 58). We can 
see that excitation started by the first nerve cell in the chain returns to that 
cell through a number of other cells. We know that recurrent circuits are 
active for brief periods of time following stimulation of the central nervous 
system. A recurrent circuit can act to preserve for a brief period of time 
excitation in a particular group of nerve cells. This, It can temporarily 
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hold the memory of some stimulation until a secondary but permanent con- 
nection has developed sufficiently to take over. The secondary change (the 
one responsible, says Hebb, for everything but events such as the imme- 
diate memory span) is the result of growth of nerve fibers toward one 
another at the synapse, or junction, between the nerve cells. He specifically 
suggests that one cell acts upon another to form knoblike growths that make 
the transmission of impulses through one particular group of cells easier 
than through another. The growth of the fibers itself is the result of activity 
of the nerve cells. Thus, when a particular conjunction of external stimuli 
occurs, they are likely to elicit activity in the same group of cells that was 
previously active. The result is a permanent connection between the ex- 
ternal stimuli and events in the central nervous system. 


The Potentiation Effect and Eccles’s Theory 


This second, permanent mechanism for establishing connections between 
cells is very much like a mechanism suggested by the neurophysiologist 
Eccles (1953). Since Eccles derives his notions from direct neurophysio- 
logical evidence, it would appear that we are remarkably close to closing 
the gap between neurophysiology and the psychology of learning. 

Facilitation of one reflex function by the activation of another is a com- 
mon neurophysiological observation. Ordinarily, however, such facilitation 
is limited to the time at which the activation of the second system occurs 
or to a fraction of a second thereafter. Recently, however, neurophysiolo- 
gists (Lloyd, 1950; Eccles and Rall, 1951) have observed a facilitation 
effect which lasts not for a few thousandths of a second but for a good 
many minutes. This effect has been called post-tetanic potentiation. On a 
logarithmic time scale this long-lasting facilitation is well on the way to 
the relatively permanent facilitation of responses to conditioned stimuli 
produced by their being paired with unconditioned stimuli. 

Eccles believes the potentiation effect to occur by a swelling of the end 
knobs at synaptic junctions (this is, of course, similar to Hebb’s notion of 
end-knob growth). Eccles then goes on to develop a theory of conditioning 
built on the mechanism (end-knob swelling) that seems to account for post- 
tetanic potentiation. 

His theory of conditioning concerns the interaction between systems of 
neurons called pathways. These pathways converge on one another in such 
a way that the action of one tends to facilitate action in another. Thus the 
occurrence of excitation in the pathway carrying impulses from the uncon- 
ditioned stimulus facilitates the impulses from the conditioned stimulus so 
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that they carry over to the motor neurons and produce a conditioned re- 
sponse. 

The theory can account for some of the facts of classical conditioning 
(the fact, for example, that the conditioned response is not quite the same 
as the unconditioned response), but it is wide of the mark on many other 
facts. For one thing, the theory clearly predicts that a conditioned response 
should be weakened through disuse, and that, indeed, disuse should be a 
more effective weakening agent than experimental extinction. This is clearly 
contrary to fact, and presents a serious theoretical difficulty. Malmo (1954) 
has suggested that this could be corrected by appealing to a neurophysio- 
logical principle of inhibition (hyperpolarization). Thus it is quite likely 
that additional work may make it possible for the theory to account more 
adequately for the basic facts of conditioning. 

Although the theory does not adequately explain the behavioral facts, it 
is solidly based on neurophysiological concepts, and here lies its importance. 
It is not an ad hoc theory put together to account for the behavioral facts 
of conditioning; it is a theory that attempts a genuine integration of neuro- 
Physiological concepts and the principles of conditioning. It is possible that 
this represents the first real bridge between behavior and the basic laws 
governing the interaction between neurons in the central nervous system. 
At any rate it is difficult to see how the long-lasting facilitation effects found 
in post-tetanic potentiation can be ignored in any future neurophysiological 
theory of conditioning. 


Ethology and the Psychology of Learning 


Experimental psychologists, for the most part, have been interested in 
the study of the basic principles of behavior, particularly as they apply to 
the behavior of human beings. Biologists and naturalists, on the other hand, 
have been interested in the behavior of animals and in the relation between 
animal behavior and biological problems in ecology and evolution. This 
kind of study has come to be known as ethology. Quite recently, the labora- 
tory approach to behavior, characteristic of the experimental psychologists, 
and the field approach, characteristic of the ethologists, have tended to 
come together. In this section we shall deal with some of the relations be- 
tween ethology and the psychology of learning. 


Habituation and Imprinting 


We have described the two basic kinds of learning to be classical con- 
ditioning and instrumental learning. We have, however, restricted our dis- 
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cussion almost entirely to the behavior of mammals. When we consider 
learning in other vertebrates and in invertebrates, the ethologist reminds 
us that there are kinds of learning which do not seem to be easily described 
as either classical or instrumental conditioning. Two other classes of the 
modification of behavior that have been intensively studied by students of 
animal behavior are habituation and imprinting, and we shall consider 
these briefly. 

Habituation. Humphrey (1933) gave us the first clear account of 
habituation and its importance in the animal world. Habituation is learning 
not to respond to stimuli which tend to be without significance in the life 
of the animal (Thorpe, 1956). A familiar example is the description of a 
simple experiment with a snail (Humphrey, 1933). If as the snail moves 
along a board, the board is periodically jerked, there is a reflex withdrawal 
of the tentacles. As the jerking is repeated, however, this withdrawal move- 
ment gradually diminishes and finally stops altogether. If the intensity of 
the stimulus is increased, the response will reappear, only to disappear with 
further repetitions. A period of rest will allow the response to recover spon- 
taneously, but as in the case of the spontaneous recovery of extinction, re- 
peated experimental sessions will finally cause the behavior to disappear 
more or less permanently. This last point is especially important, because it 
is this which differentiates habituation from simple fatigue. 

It is extremely difficult to obtain evidence for associative learning (either 
of the classical conditioning or the instrumental variety) in single-celled 
animals or in very primitive metazoa, such as the coelenterates. On the other 
hand, habituation can be readily demonstrated with every animal (Thorpe, 
1956) and it must be regarded as the most fundamental and elementary 
example of the permanent modification of behavior by repeated stimulation. 
Of course, habituation is very similar to experimental extinction, and it may 
well be that the most elementary examples of extinction are examples of 
habituation of learned rather than unlearned responses. 

Thorpe (1956) points out that habituation is of fundamental importance 
in the natural life of animals. Furthermore, he emphasizes, it is probably 
related to the gradual atrophy of instinctive reactions when they are allowed 
to happen without consummation of the appropriate appetite behind them. 
Thus, both learned and unlearned acts seem to be subject to disappearance 
on repeated elicitation. It is only the activation of the motivation system 
(perhaps centered in the hypothalamus ) that keeps responses alive. In other 
words, widespread habituation of behavior may very well be something 
which has to be overcome by the positive effects of motivation. 
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Imprinting. In recent years there has been a good deal of attention 
given to a peculiar variety of associative learning called imprinting. It was 
originally discovered in birds, but it is now suspected to be quite widespread 
in the animal kingdom. The peculiar features of imprinting are as follows: 
(1) it occurs with spontaneous rather than reflex behavior (though it is be- 
havior “released” by external stimulation), (2) it requires no specific rein- 
forcement for activation, and (3) it seems to occur only during a restricted 
period in the life of very young animals. 

The phenomenon of imprinting first received attention from the work of 
Lorenz (1935), who described a typical observation on imprinting. The 
eggs of a goose are hatched in an incubator and the young are restricted 
to the influence of human beings only; they never see the adults of their own 
species. The result is that the goslings come to follow the human being 
(when he quacks), rather than one of their own species. As we know, gos- 
lings typically follow their mother in a procession. This particular activity 
is not so much the result of the distinctive visual stimuli from the parent 
bird as it is the result of the association of any moving object the bird sees 
with the release of the response of following. This imprinting of the young 
bird appears to be limited to very brief periods in its life (Ramsey and 
Hess, 1954). 

It has sometimes been supposed that imprinting is irreversible both in 
the sense that it is never forgotten and in the sense that once a young bird 
is imprinted to respond to a particular stimulus (say a moving shoe box) 
it cannot be imprinted to other objects. The latter statement is apparently 
untrue; Jaynes (1956) was able to obtain considerable evidence of general- 
ization of imprinting to objects quite dissimilar to the original imprinted 
stimulus in young chicks. 

Imprinting is the acquisition of the ability by new cues to command in- 
nate behavior. Although most of the evidence on imprinting has been ob- 
tained with birds, there is evidence that a similar effect occurs in mammals 
and perhaps in insects (Thorpe, 1956). The great importance of imprint- 
ing in the life of animals is the plasticity, or freedom, in response to stimuli 
that it gives to certain kinds of instinctive activities. We are still uncertain 


‘about how it is related to the more universal kind of associative learning 


exemplified by classical conditioning and instrumental behavior. Until we 
know more about it, imprinting remains something of a curiosity in the 
description of learning. 
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Ethology and Learning 

Thorpe (1956) in his comprehensive survey of the relationships between 
the mechanisms of learning and the natural behavior of animals shows how 
particular aspects of learning such as stimulus generalization, latent learn- 
ing, trial and error, etc., fit into the biological pattern of survival in the 
natural world. He demonstrates how the mechanisms of latent learning work 
in exploratory activity and in the learning of topography in animals as primi- 
tive as insects. 

He describes, for example, the dependence upon learning of an instinctive 
bit of behavior, the egg-laying and provisioning behavior of the hunter Wasp. 
This animal is able to make several burrows in different places, lay eggs in 
each of them, and return later to provide food for the young. The tendency 
to return is itself determined by innate releasing mechanisms (instinct), 
but the location of the burrows must be learned by the animal. The learning 
is latent because the behavior of returning to the burrow is not always ac- 
companied by direct reinforcement. 

Furthermore, Thorpe was able to demonstrate, by controlled observa- 
tions, that this must be the learning of stimulus relations rather than spe- 
cific responses, for these animals can be picked up and released in strange 
locations and still find their way back to the nest (Thorpe, 1950). Figure 
59 shows some of Thorpe’s tests for detour, or barrier-circumventing, be- 
havior in these wasps as well as the effects of release in a strange location. 
Notice the straight and regular path of the wasp to the burrow following its 
release in new surroundings. 

Thorpe (1956) argues that some of the spectacular observations on com- 
munication in bees that have received so much attention recently must indi- 
cate rather complicated latent learning in these animals. Bees can return 
to a hive after locating a source of sugar, and when they return they com- 
municate to other bees the direction and approximate distance of the source. 
The communication is by means of a complicated dance. Dethier (1957) 
has shown that a similar pattern of behavior in the fly can be accounted for 
by direct stimulus-response and physiological mechanisms, and it is very 
likely that the activity of the bee can be accounted for by similar mecha- 
nisms. Thus, spectacular as this behavior is, it is probably not an example 
of learned behavior. 

There is great difficulty in separating the influence of instinct and learn- 
ing in animals. Only now are we beginning to understand how these things 
interact in the natural world. Most animal behavior is a complicated inter- 
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Figure 59. Shows the route taken during detour and displacement experiments on an individual 
wasp. The heavy line shows the course of the wasp. A is the point at which it was first observed. The 
numbers 1 to 4 indicate the points at which metal screens were placed in the wasp's path to force 
detouring. The insect was captured at B. It was transported over the dotted line and released at 
Point C. The remaining numbers indicate further detour experiments. N is the nest, finally reached 
by the insect. The time taken by the insect was approximately 15 minutes. (Thorpe, 1956.) 


weaving of instinctive and learned components. Once an instinctive act has 
been released, it has opened the way for possible modification. Certain 
kinds of learning depend upon instinctlike mechanisms (imprinting), and 
SO it is safe to say that learning is modified by instinct as much as instinct 
is modified by learning. 


LEARNING APPLIED TO SOCIAL AND PSYCHOLOGICAL PROBLEMS 


We have had little to say—and that only incidentally—about the appli- 
cation of the psychology of learning to general social and psychological 
problems. More often than not, the worker in the psychology of learning 
has had in mind only remotely, if at all, an eventual application of his work 
to the more pressing problems of human behavior. Nevertheless, the appli- 
cation has gone forward, sometimes wisely and sometimes not. A few 
theorists and experimentalists have argued that the justification for the 
theoretical study of learning lies in its application, and some of these psy- 
chologists have enthusiastically applied the experimental evidence and 
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theories of learning to problems in personality development and social 
relations. 

Unfortunately, there is some confusion about the application of learning 
theory, because some ambiguity exists in the meaning of learning theory. 
In the past generation, theories of behavior have come to be identified 
with theories of learning. Thus, we find B. FE. Skinner generally classified 
as a learning theorist, though he is very little concerned with learning and 
memory. With considerably more justification he could be called an in- 
centive theorist. We shall try to avoid this identification of learning with 
the general theoretical principles of behavior and try to restrict our discus- 
sion to associative problems. Of course, we shall not be able to ignore en- 
tirely motivational and emotional problems, since these frequently turn out 
to be based on associations between behavior and its consequences. 

The applications of the psychology of learning are numerous, and we 
can discuss only a few representative examples from the fields of personal- 
ity, social development, and educational psychology. These should give us 
some indication of the direction in which applications of learning have gone. 


Problems in Personality and Social Development 


In recent years there have been many attempts to apply the basic data 
and theories of learning to the problems of human personality. One of the 
earliest is found in Guthrie’s Psychology of Human Conflict (1938). Guthrie 
attempted to account for problems in personality and personal conflict by 
relying almost exclusively on classical conditioning as the model for all 
kinds of behavior, reflex and instrumental, emotional and intellectual. 
Guthrie's attempt is distinguished by ingenuity and style, but it is more 
of a tour de force than a serious attempt to cope with the complicated prob- 
lems of human behavior in terms of associative learning. 

More recently, learning theorists have attempted to deal with more spe- 
cific problems, such as the course of Psychotherapy. Furthermore, these 
attempts, unlike Guthrie's, make some verifiable predictions about the 
problems with which they deal. In the next few pages we shall examine 
some of these applications of the basic learning Principles to the analysis 
of personality and psychotherapy. 

One of the most persistent efforts of the immediate past has been to 
achieve a synthesis of psychoanalytic and learning theories. This has in- 
triguing possibilities, and since, as has often been pointed out, Freud was 
basically an associationist, it has not been so difficult a task as might be 
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supposed. Much of the work we shall discuss has been written from this 
synthetic psychoanalytic-learning point of view. 


Personality and Psychotherapy 


Shoben (1949) suggested that one of the major goals of psychotherapy 
is the acquisition of symbolic control over behavior. The process of psycho- 
therapy is described by Shoben as the lifting of repression and the develop- 
ment of insight through symbolic reinstatement of the stimuli for anxiety, 
the diminution of anxiety by counter conditioning, and the process of re- 
education with the help of the therapist. Within this framework are many 
problems directly related to the principles of learning. 

Dollard and Miller. One of the most convincing attempts to apply the 
Principles of learning to personality and psychotherapy is demonstrated in 
Dollard and Miller's Personality and Psychotherapy (1950). The authors 
Write from the point of view of Hull's general theory of behavior and 
learning, but save for a few points, they could have written their book 
from the point of view of the purely empirical study of learning. Much of 
their book is devoted to an analysis of a case history and the course of 
Psychotherapy in this case. This gives their book an extremely realistic 
flavor, but it only begins to suggest the possibilities of the analysis of psy- 
chotherapy from the point of view of the learning process. 

Dollard and Miller devote some space to outlining the basic principies 
of conditioning and learning. They then show how these principles can ac- 
Count for the acquisition of fears and anxieties in infancy and childhood. 
They argue, for example, that if a baby becomes very hungry when it is 
alone and cries and frets while the hunger drive increases, when it is finally 
fed there will be a large-scale reduction of need. These responses of crying, 
fretting, etc., are reinforced and thus become the characteristic responses 
Of the child when left alone. 

The bulk of their book is devoted to the consideration of psychotherapy 
as a learning process and to an analysis of the case illustrating the role of 
mechanisms of learning in psychotherapy. Dollard and Miller start out by 
Characterizing the neurotic person as someone who learns inadequate ways 
of thinking and dealing with himself. The inability to think about his basic 
Problems may make the neurotic appear to be stupid; the Pressures of con- 
flict make him feel miserable. Neurotic activities and psychosomatic symp- 
toms are the ways in which he has learned to alleviate his misery partially. 


8 See the section in Chap. 6 on Dollard and Miller. 
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The business of psychotherapy, according to Dollard and Miller, is to 
correct the faulty habits the neurotic has learned. The therapist sets up a 
warm and permissive atmosphere in which the neurotic’s anxieties and fears 
can be symbolically introduced and allowed to extinguish. The therapist 
arouses memories of anxiety-inducing events in the neurotic’s life history, 
but it is done in the permissive atmosphere of therapy, so that these fears 
and anxieties are unreinforced and may therefore be extinguished. 

One of the problems for the therapist is to attach labels to things which 
the neurotic has lost the ability to deal with mentally. Words and sentences 
are attached to emotions formerly repressed. The neurotic can then deal 
more adequately with his inner feelings and experiences and he is made 
capable of dealing with his conflicts at an overt, conscious level. This is 
essentially the process of discrimination, and these two processes, extinc- 
tion of fear and anxiety and the development of discrimination, are the 
essential contributions of psychotherapy. 

Dollard and his associates (Dollard, Auld, and White, 1953) have done 
more than simply present the process of psychotherapy as interpreted by 
learning theory; they have suggested techniques for the evaluation of ther- 
apy and the analysis of its progress based upon this interpretation. Thus, 
such an approach to psychotherapy is of considerable potential value to the 
practical clinical psychologist. 

Mowrer. A similar analysis of personality disorder and psychotherapy 
has been presented by Mowrer. More recently, however, Mowrer has ex- 
panded his point of view to encompass a vast multitude of scientific and 
moral problems in the development of personality and psychotherapy. He 
has considerably broadened the concepts of learning to be applied to the 
analysis and description of personality. He makes much more use of a 
concept of mediational activity (or “pure stimulus act,” as Hull has called 
it) than do Dollard and Miller. For example, he sees thinking as a form of 
activity in which the organism makes symbolic responses to find out what 
effects will occur if the action is really carried out (Mowrer, 1954). 
Thought, says Mowrer, is preparation for action, fantasy is substitution for 
action. Therefore, although he finds a role for the habit processes much 
like those suggested by Shoben and by Dollard and Miller, he finds the be- 
havior of the individual in psychotherapy much more complicated than 
learning theory allows. 

Indeed, learning theory does offer a most mechanical and limited model 
of the individual in difficulty. Since some model is better than none at all, 


+ See, for example, Mowrer (1950). 
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Mowrer and others of his general viewpoint run the danger of losing the 
precision and clarity of a simple model by trying, at this stage, to schematize 
the most subtle aspects of personality and its development. It is perhaps 
good that learning models of personality action and psychotherapy are 
rather simple, for they encourage us to concentrate on solving a few ele- 
mentary problems. The richer, more luxuriant theories of personality, on 
the other hand, so overwhelm us with the complexities of the problems to 
be dealt with, that they discourage investigation. Thus, one of the real con- 
tributions of learning to the study of personality should be the simplifica- 
tion it introduces. 


Learning and Socialization 

Children must learn to become adequate adults in the different societies 
to which they belong. Societies differ from one another considerably in the 
extent and kinds of demands they make upon individuals. Consequently, 
Cross-cultural studies of socialization in different societies present an impor- 
tant challenge to the psychology of learning. 

Unfortunately, since such a study is just beginning, we cannot expect to 
find well worked out theoretical applications of the principles of learning. 
The work of Whiting and Child (1953) has suggested ways of studying the 
influence of infant and child care on the characteristics of adults in different 
Societies. 

Child (1954) points out that the learning of infants and children is more 
broadly generalized than that of older individuals. This is because young 
Children are preverbal and thus do not have the mediating responses for per- 
forming fine discriminations. Such behavior is less susceptible to extinction 
and verbal control. Secondly, Child points out, the infant is helpless and 
hence is often subjected to drives of great intensity. Therefore, drive reduc- 
tion is a basic and powerful element in the control of behavior of very 
Young infants. Thus the stage is set for the learning of powerful emotional 
Teactions and the setting up of secondary drives. The problems of early 
learning, such as those encountered in toilet training, weaning, and control 
Of aggression, should be powerful influences in shaping the personality of 
individuals. f . | 

Seward (1954) attempts a rigorous derivation from basic learning prin- 
ciples of the role of reinforcement in developing identification in children. 
Identification is the process of learning the values of others, particularly 
those of the parent. Seward deduces that the kind of identification devel- 
Oped by punishment will be closely linked to the kinds of behavior actually 
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punished, while approval will tend to produce identification with much 
wider generalization. 

The role of reinforcement is only one of the many mechanisms that 
work in the kind of socialization that leads to identification, but it is an 
important one. Notions like Seward’s have implications for a cross-cul- 
tural analysis of socialization, for it is possible to examine cultures which 
differ in the way in which identification is induced to see what differences 
are produced in the kinds of identification achieved. 


Learning and Theories of Personality 


It is encouraging to see that the general concepts which have arisen 
from the study of learning are now being incorporated into the body of 
personality theory. This is partly because of the close parallel between cer- 
tain classical psychoanalytic notions and some basic concepts in learning. 
Over and above this, however, there has been a genuinely new appreciation 
of the value of some of the familiar ideas in the study of learning for the 
understanding of personality development. 

Rotter (1954), for example, has presented a rather detailed theory of 
personality development from the point of view of social learning. He adds 
anew dimension to the understanding of personality by means of a concept 
of reinforcement value. He reminds us that not only are different reinforce- 
ments effective with different people, but that for any individual or group 
of individuals there is an organized hierarchy of values of reinforcement. 
Furthermore, these hierarchies evolve with experience, and this kind of 
change is one aspect of personality development. 

Rotter shows that the reinforcement value of a certain reward is to a 
considerable extent determined by the individual’s anticipation of the pat- 
tern or frequency of future reward. This notion allows Rotter to reinterpret 
the role of delay of reinforcement in maladjustive behavior. Various PSy- 
chologists have supposed that maladjusted individuals, criminals, and neu- 
rotics are marked by an inability to act on the basis of delayed reinforce- 
ments; such people seek immediate gratification. Thus, says Rotter, if we 
were to offer a group of children the choice of a penny candy today or a 
5 cent candy tomorrow, at least some of the children would choose the 
penny candy today. From Rotter’s point of view, such a choice is not be- 
cause the delayed reinforcement has a lesser value, but because some indi- 
viduals have learned that reinforcements promised for the future are not 
likely to occur. 

Rotter and other theorists have proposed a large number of mechanisms 
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derived from basic studies of learning to account for various aspects of per- 
sonality development. These concepts should have the effect of leading to 
fruitful experimental investigation of personality mechanisms. 


Problems in Education 


At the beginning of the twentieth century there was much hope that the 
experimental study of learning would lead, in a fairly short time, to a scien- 
tific analysis of the problems of school and teaching. Many early books on 
learning were greatly concerned with the applications of existing facts to 
Problems in education. In a very real sense, some valuable contributions 
Were made to the practice of education. The early studies of transfer, par- 
ticularly those of Thorndike and Woodworth (1901), were influential in the 
decline of the notion of formal discipline. All in all, however, the experi- 
mental study of learning did not live up to the early hopes for application 
that it aroused. By 1930, workers in this field disclaimed any attempt to 
achieve results which would have educational significance. At the same 
time a great interest in theories of learning arose, and these were mostly 
Concerned with the behavior of animals. 


Early Problems in Educational Psychology 

The influence of E. L. Thorndike sat heavily upon educational psychol- 
Ogy for many years. Thorndike’s particular interpretation of learning— 
Which he later called connectionism— provided the core for the account of 
learning to be found in early textbooks on educational psychology. The 
Principle of effect and trial-and-error behavior were the two central themes 
in early theories of learning. 

There is nothing wrong with an account of reinforcement and trial-and- 
error processes in a textbook in educational psychology, except that these 
things by themselves do not make a complete and convincing picture ‘of 
the learning of children. One of the outcomes of the disappointment with 
the classical Thorndikian point of view was that a formal discussion of the 
Psychology of learning all but disappeared from Becounts of educational 
Psychology. Theories of learning contained so little of interest to educators 
that such theories were relegated to an extremely NG role in the PSy- 
chology of education. Since theories of learning had little or nothing of 
Practical value to contribute, their place in the textbooks was taken by new 
Material on personality development, mental hygiene, eto: | ৷ 

Of course, the basic applied material on learning remained. This was 
mainly an account of the conditions which promote efficient classroom 
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learning. The textbooks discussed massed and distributed practice, the effect 
of recitation on memory, and motives in classroom learning. There was 
little of theoretical interest discussed; the effect of recitation on memory 
was simply presented as a fact, and its interpretation in terms of recoding 
and transfer was ignored. 

Transfer of training (although not always Thorndike’s concept of iden- 
tical elements) remained as the most basic contribution of learning to edu- 
cational psychology. This emphasis gradually changed, however. The ear- 
lier use of transfer was to further the social-utility movement in education. 
The apparent lack of transfer from the more traditional school subjects to 
the real and important concerns of human life was sufficient reason for de- 
creasing the emphasis upon these subjects. The curriculum was revised so 
that the traditional subjects would be more likely to transfer to the prob- 
lems of daily living. 

More recently, we have come to realize that maximizing transfer is some- 
times less important than other considerations. It is perhaps more valuable 
to produce a limited amount of transfer from some kind of activity that 
can go on only in the schools than to produce a lot of transfer that dupli- 
cates the activity of other agencies of the community. Consequently, the 
emphasis has changed from one of revising the curriculum to maximize 
transfer to one of maximizing transfer within the framework of a curricu- 
lum determined by the needs of the schools. 


Current Problems in Educational Psychology 


Much of educational psychology today is concerned with problems of 
personality development and individuality. Since there has been a growing 
interest in the application of theories of learning to individual development 
and socialization, much of the fundamental theoretical and experimental 
work in the psychology of learning is now returning to educational psy- 
chology, albeit with a different emphasis. 

In addition, of course, there have been many developments within edu- 
cational psychology itself. The dissatisfaction with the traditional material 
in learning and the conflict between the experimentalists, who insist upon 
working with pure variables, and the educators, who find that this produces 
little of direct application, are still with us. A few individuals, however, 
have been working at bringing together the experimental approach to learn- 
ing and the problems of classroom learning. Auble and Mech (1953), for 
example, deal with the application of partial reinforcement to behavior in 
the classroom. Their results suggest that reinforcement variables must be 
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treated with extreme caution in applied problems, since children are able 
to mediate their own reinforcements. The influence of intraverbal processes 
in children, the nature of the social situation in the classroom, and other 
conditions make the application of simple variables to classroom learning 
extremely difficult. The current attention of many experimentalists in class- 
room learning is now directed toward the study of the interaction between 
social and personality variables and the variables that control learning. 

Despite the many difficulties, Skinner (1954) has recommended a 
straightforward application of the reinforcement technique for the controi 
of behavior to the classroom. He asks, What reinforcements are available 
for working with the child in school? The answer is in all of the things— 
Scissors, puzzles, noise makers—which feed back to the child changes in the 
environment and which are free of aversive, or negatively reinforcing, prop- 
erties. Skinner argues that in the schoolroom today much of the positive 
reinforcement from sheer control of the environment is masked by the 
emotional responses generated by aversive conditions. 

Skinner even goes so far as to recommend automatic devices for the con- 
trol of reinforcement contingencies, since, he argues, the personal control 
of reinforcement by a teacher is too erratic and irregular. The free use of such 
devices, says Skinner, would free the teacher to do what she is best suited for 
— deal with the socialization of the individual child. 

Although Skinner’s suggestions have outraged many people, some have 
been impressed. The kinds of techniques he suggests for the control of be- 
havior (and hence of learning) are now undergoing systematic testing. In 
the next few years we should be able to evaluate them properly. 

Recent years have seen little in the way of large-scale theories of educa- 
tional psychology. Perhaps Skinner's efforts will point in this direction. In 
a somewhat different direction, Stephens (1951, 1956) has advanced a 
theory of schooling built on the presupposition that there are many basic 
nondeliberative (one might almost say instinctive) factors underlying the 
educative process, and these, more than any particular method or practice 
in education, are the things that determine how schooling happens. 

Stephens started out originally with a hypothesis to account for the large 
literature of negative findings in the field of experimental comparison of 
the different “methods” of teaching. These appear to make little or no 
difference in the outcome of education. This, Stephens argues, is so because 
different methods can only slightly modify, not introduce or eliminate, some 
of the fundamental spontaneous factors in education. He argues that such 
matters as play, discursive conversation, tolerance of unusually communica- 
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tive people, and the fundamental operations of reinforcement—all work no 
matter what the specific method. The method supplies direction and empha- 
sis, but the fundamental work is done by the nondeliberative factors. In- 
deed, these factors are responsible for the institution of schooling itself. 

This is radically different from the kind of theory currently predominant 
in psychological and sociological discussions of education, but it makes 
much of our knowledge of the pervasiveness of learning as an evolutionary 
mechanism and of the innately reinforcing properties of nonappetitive re- 
wards, which have received much attention lately. The introduction of such 
a theory seems to indicate a reawakening of interest in the fundamental 
connection between learning and educational theories. 


IN CONCLUSION 


In this short book we have followed a complicated path through the ex- 
perimental Ue On learning. It is disturbing to think that many read- 
ers will be so lost in the details that they will fail to see the shape of things 
as a whole. In these last few paragraphs we shall attempt to give an over-all 
contour to what we have said. 

We have looked upon learning as a primarily associative process. The 
fundamental variables that control learning are temporal contiguity, simi- 
larity (both among responses, implicit and explicit, and among stimuli), 
repetitions of stimuli and responses, interaction between learned responses, 
and learned mediating responses. We have seen that the motive-incentive 
variables such as drive, emotion, and reinforcement (including time and 
pattern of reinforcement) determine the behavior of organisms. In so far 
as organisms learn what they do, these variables determine learning. Fur- 
thermore, a fundamental class of associations in the animal world is that 
Of responses with reinforcing agencies. We have seen, however, that some- 
times when organisms behave without specifically channeled motives or 
well-defined reinforcements they still learn. Thus, these motive-incentive 
conditions which guide and control behavior are not essential to learning. 
The essential features of learning are associations among stimuli and among 
stimuli and responses as these occur in the experience of the organism. 

‘The variables determining learning interact with species and individual 
differences among organisms, and they are limited by the capacities of ihe 
organisms. We have seen the limitation that the immediate memory span 
imposes on human memory and how this limitation is circumvented by repe- 
tition and recoding. We have seen how intraorganic symbolic processes are 
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necessary for certain kinds of “insightful” problem solving and how the 
lack of such processes produces an inevitable limit to the problem-solving 
ability of certain animals. 

We have also discussed the role of language in recoding man’s experi- 
ence. We have had little to say about the learning of linguistic habits them- 
selves. Such habits are, to some extent at least, associative; they are associa- 
tive in the sense that the learning of nonsense syllables provides a convenient 
and crude model for the development of such habits. Linguistic habits, how- 
ever, are more than simply chains of probabilistic associations. Human lan- 
guage has a grammar whose existence makes the learning of natural linguistic 
habits very different from the learning of nonsense syllables. 

Grammar is not purely associationistic. There are certain elementary 
“forms” in grammar into which we can place words, ideas, even new words 
and new ideas that have never been used before. We can easily recognize 
the elements of the common forms of our mother tongue, even though non- 
sense may be put into those forms. “Gelix frimmaged a leble.” This can be 
identified by most intelligent users of English as a simple, declarative sen- 
tence. It is not particularly difficult to identify the subject, the verb, the 
object, and the syntax in which these appear. Part of our learning of lan- 
guage is learning the sequential and semantic associations that are the com- 
mon property of the community, but another part is the learning of linguis- 
tic forms into which we can place our words. 

We have had little or nothing to say about how such grammatical forms 
are learned and used, for the very good reason that this is one of the prob- 
lems in verbal learning that has remained almost untouched. It is one of 
the really few virginal territories left for research in verbal learning, and it 
is to be recommended to anyone who wishes to make a fundamental con- 
tribution to the understanding of the human mind and its verbal processes. 
Therefore, the great emphasis upon sequential and mediational associative 
processes we have made in this book needs some correction. These associa- 
tions are important, but associations imposed within grammatical forms are 
also important, and we have as yet few tools for dealing with them. 

In our discussion of verbal learning we had little to say about meaning, 
mainly because what can be said is not primarily a problem in learning. 
Suffice it to say that the prevailing view in psycholinguistics today is that 
meaning has its roots in association. The meaning of a word, a phrase, or 
any kind of verbal unit is determined by the associations of that unit with 
other verbal and nonverbal responses and with external stimuli generally. 
Meanings, indeed, seem frequently to be little more than the bridge between 
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one associative cluster and another. Thus, the general question of meaning 
is likely to get lost in the details of recoding, associative clustering, and 
stimulus generalization. 

There are many points at which the study of learning crosses the other 
disciplines concerned with man’s mind and its works. Language, neuro- 
physiology, personality development, and social anthropology—all cross in 
different ways. It we have achieved some understanding of the way in 
which organisms learn, we shall carry to these other disciplines something 
of fundamental importance. 

One basic idea that has come out of the systematic study of learning is 
the attitude that some of the complexities of human life and society can 
be understood in terms of the operation of basic learning laws. If the impli- 
cations of this idea have begun to stir, we will have accomplished something. 

Always, of course, a discipline such as the psychology of learning prom- 
ises more than it can achieve. It leaves a model of the organism that is 
stiff, mechanical, and only a faint copy of reality. It is, however, better 
than no model at all, and if we are to understand the behavior of organisms 
in an abstract, scientific sense we badly need such a model. The course of 
future work in learning and in the basic theory of behavior that goes along 
with it will be in the direction of introducing some flexibility and life into 
the model. At the same time, perhaps paradoxically, the model will proba- 
bly become more abstract. In other words, certain concepts we now use, 
which are crude physical models, will tend to become more and more 
abstract and mathematical in character. This dual process of introducing 
flexibility and abstractness into the behavioral model of organisms appears 
to be the fundamental task of future research in the psychology of learning. 

It also is beginning to appear more likely in fact, rather than merely in 
hope, that the psychology of learning will begin to approach a unity of con- 
cepts with other disciplines. Thus the fundamental associative concepts of 
the future will probably be stated in such a way that they have some neuro- 
physiological as well as behavioral validity. All this is in the future, how- 
ever, and at present our horizons are limited by our ability to see only the 
dimmest connections between learning and the nervous system and between 
learning and man's best intellectual efforts. 

What has gone into this book is a short account of the present status of 
the psychology of learning. Like all healthy, growing scientific efforts, it is 
changing continuously. Some of the gaping deficiencies and points of ob- 
scurity and downright error will soon be corrected, but it is unlikely that 
the main outlines will change very much in the near future. 
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